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Abstract 

We investigate the general structure of optimal investment and consumption with small 
proportional transaction costs. For a safe asset and a risky asset with general continuous dy- 
namics, traded with random and time-varying but small transaction costs, we derive simple 
formal asymptotics for the optimal policy and welfare. These reveal the roles of the investors' 
preferences as well as the market and cost dynamics, and also lead to a fully dynamic model for 
the implied trading volume. In frictionless models that can be solved in closed form, explicit 
formulas for the leading-order corrections due to small transaction costs obtain. 
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1 Introduction 

Classical financial theory is built on the assumption of perfectly liquid markets. If prices follow 
continuous-time diffusions, then this also holds for most optimal trading strategies [49t I5U|. As a 
result, frictionless models typically prescribe incessant trading, which is unfeasible with even the 
slightest market imperfections. Proportional transaction costs represent one such friction present 
even in the most liquid financial markets in the form of bid-ask spreads. The study of their impact 
on portfolio choice was initiated by the seminal papers of Constantinides and Magill [44| ITO] , as well 
as Dumas and Luciano [15jj^ In infinite-horizon models with constant risk aversion, transaction 
costs, and investment opportunitiesj^ they argued that it is optimal to refrain from trading until 
one's position leaves a so-called "no-trade region" of constant width around the frictionless target. 
The corresponding welfare effect of transaction costs was found to be small, as "the derived utility 
is insensitive to deviations from the optimal portfolio proportions, and investors accommodate large 
transaction costs by drastically reducing the frequency and volume of trade" [lOj. Put differently, 
investors whose only motive to trade is rebalancing towards a constant frictionless target after 
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substantial price drops or rises do not suffer severely from a wider bid-ask spread if they adjust 
their trading strategies accordingly. 

In the last decades, the substantial decline of bid- ask spreads across financial markets has 
sparked a huge increase of trading volume, and spurred the continued growth of high-frequency 
trading. The corresponding high-volume strategies naturally require a careful assessment of the 
trading costs they generate when reacting dynamically to various signals from the market. Ac- 
cordingly, there has been growing interest in portfolio choice with transaction costs and stochastic 
opportunity sets, in financial economics [3l HSl |33l [251 [181 19] , mathematical finance [571 [27], aiid also 
in the industry practice of quantitative finance |l8l [171 [37] . Broadly speaking, these studies either 
employ numerical methods, or study the practically relevant limiting regime of small transaction 
costs to shed more light on the salient features of the solution. 

The present study extends and unifies the results of the second strand of research, by abstracting 
from concrete models and unveiling the general structure of portfolio choice with small transaction 
costs. We provide simple asymptotic formulas for approximately optimal trading strategies as 
well as the corresponding welfare and implied trading volume in very general settings]^ To wit, 
we consider investors with general preferences over intermediate consumption and terminal wealth, 
who receive a random endowment stream and trade a safe and a risky asset with general Ito process 
dynamics in the presence of random and time- varying but small transaction costs. Even in this 
generality, the structure of the solution turns out to be of an unexpectedly simple form. 

With small costs, investors should keep their holdings in a time- and state dependent no-trade 
region. The latter is generally not centered around the frictionless target, because past transaction 
costs reduce investors' wealth. However, the optimal adjustment is the simplest one conceivable: 
investors just change their target position (and also their optimal consumption) exactly as they 
would in the frictionless case to account for their reduced wealth. The halfwidth ANTt of the 
optimal no-trade region is given by the cubic root of three factors, stemming from the width of the 
spread, the optimal frictionless strategy, and the investors' preferences, respectively: 



are not hedged at the leading order. Concerning the frictionless optimal strategy ipt, the crucial 
quantity turns out to be its local quadratic variation d{ip)t normalized by the one d{S)t of the 
market, i.e., the ratio of squared diffusion coefficients. The basic tradeoff is that more active target 
strategies require wider buffers to save transaction costs, whereas turbulent market times call for 
closer tracking to limit losses due to displacement from the target portfolio. The final ingredient 
for the width of the no-trade region is the risk tolerance Rt of the investors' indirect utility, which 
subsumes their preferences by weighting the relative importances of current and future consumption 
streams against each other|^ 

The corresponding utility loss due to small transaction costs can also be quantified. At the 
leading order, it is given by the squared halfwidths of future no-trade regions, suitably averaged 

^Corresponding results for the Black-Scholes model have been obtained by Shreve and Soner Whalley and 
Wilmott [62], Janecek and Shreve [2^, and in many more recent studies. Using formal perturbation arguments, 
Martin and Schoneborn [481 147j study local utility maximizers, and the companion paper of the present study [27] 
deals with exponential investors in a general setting. Soner and Touzi [^ as well as Possamai', Soner and Touzi [52j 
study infinite-horizon consumption models with general utilities in a multidimensional complete market, and provide 
verification theorems based on the theories of viscosity solutions and homogenization. 

^Without intermediate consumption, the special case of a risk-tolerance wealth process also plays a key role in the 
work of Kramkov and Sirbu (34i ,35. i36j on utility-based prices and hedging strategies for a small number of claims. 
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Small spreads only enter through their current halfwidth et, i.e., the dynamics of future costs 
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with respect to both time and states. Here, time is measured in business time, i.e., a clock that runs 
at the speed of the market's local variance: losses due to trading costs accrue more rapidly in times 
of frequent price moves. Averaging across states is performed under the investors' marginal pricing 
measure, i.e., the impact of the small costs is priced using the frictionless investors' marginal pricing 
rule. The key determinants for the welfare loss caused by small transaction costs are again the 
width of the spread, the investors' indirect risk tolerance, and the activity rate or portfolio gamma 
d{ip)t/d{S)t of the frictionless target strategy and the market. The portfolio gamma can therefore 
be interpreted as a sensitivity with respect to market liquidity: passive investors with relatively 
inactive strategies are insensitive to changes in the spread, in stark contrast to more active traders. 
As observed by Rogers [53], the utility loss due to small transaction costs is composed of two parts: 
on the one hand, there are the direct costs incurred by actual trades. On the other hand, there 
is the displacement loss due to having to deviate from the frictionless target position. For small 
costs, we find that the relative sizes of these two contributions are universal, irrespective of asset 
and cost dynamics, and investors' preferences: transaction costs always contribute two thirds of the 
leading-order certainty equivalent loss, whereas the remaining one third is caused by displacement. 
For a small Tobin tax [60], this implies that two thirds of the welfare lost by investors is paid out 
in taxes. The remaining one third dissipates due to suboptimal portfolio composition. 

Our results also lead to a tractable model for trading-volume dynamics. This is one area where 
frictionless models fail dramatically, leading to infinite turnover on any time interval. Models with 
proportional transaction costs lead to finite trading volume. Yet, they also do not model the 
turnover generated by a representative investor in a realistic manner, since they prescribe trading 
of "bang-bang" type: the investor either does not trade at all, or at an infinite rate. For small costs, 
however, we find that turnover can be approximated by a finite rate at the leading order, in line 
with the models typically used in the price impact literature [21 [T8] [22] . Implementing frictionless 
strategies with a constant buffer leads to trading volume proportional to the quadratic variation 
of the target. In contrast, the turnover generated by optimal implementation is determined by a 
geometric average of the local variabilities of both the frictionless target strategy and the market, 
scaled by risk tolerance. This increasing relation with market volatility is in line with the empirical 
findings documented by Karpoff |31] . With constant investment opportunities, constant relative risk 
aversion implies a constant turnover rate, whereas stochastic opportunity sets driven by stationary 
factors lead to stationary models for relative share turnover, allowing to reproduce empirical stylized 
facts such as mean reversion and autocorrelation (cf. Lo and Wang [40j). At the leading order, 
turnover is inversely proportional to the cubic root of the spread, irrespective of the latter's future 
dynamics. Ceteris paribus, the model therefore predicts that reducing an already small spread by 
10% should increase turnover by about 3.6%, irrespective of preferences as well as asset price and 
cost dynamics. 

The above results on utility maximization extend to other widely-used optimization procedures, 
such as mean- variance portfolio selection in the spirit of Markowitz |45j and the Kelly criterion |32j 
of maximizing the long-run growth rate. 

As in the frictionless case, the mean-variance optimal portfolios are obtained by rescaling the 
optimal strategy for (truncated) quadratic utility. Obtaining a given target return with transaction 
costs requires a larger multiplier and a bigger variance of the portfolio. Conversely, in the presence 
of transaction costs, a given variance bound leads to a smaller multiplier and a reduced return. Both 
of these effects are magnified if an ambitious target return resp. loose variance bound prescribes 
large positions in the risky asset. Nevertheless, the corresponding Sharpe ratio remains universal 
among all mean-variance optimal portfolios: it is simply decreased by a constant to account for the 
presence of a nontrivial spread. Here, at least two thirds of this welfare effect are caused directly 
by trading costs, whereas at most one third is due to displacement from the frictionless target. 
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In the absence of frictions, it is well known that the optimal portfolio for logarithmic utility 
maximizes the long-term growth rate not only in expectation, but also in an almost-sure sense. 
For small costs, we establish that this remains true at the leading order. As in the frictionless 
case, the (approximately) growth-optimal portfolio turns out to be myopic, in that it is determined 
completely by the local dynamics of the model. An explicit formula for the leading-order reduction 
of the long-run growth rate is also provided. 

The remainder of the article is organized as follows. Section [2] introduces the model and collects 
the inputs from the frictionless investment/consumption problem needed to formulate the leading- 
order corrections for small costs. Subsequently, the main results are presented and discussed in 
Section [3j The important special case of investors with constant relative risk aversion is treated 
in Section [4} Afterwards, we turn to mean- variance portfolio selection and the growth-optimal 
portfolio with transaction costs, before concluding in Section [7j The derivations of all results are 
collected in Appendices [ApH These are based on applying formal perturbation arguments to the 
martingale optimality conditions of a frictionless "shadow price" [IH HT} [26] , which yields the same 
optimal strategy and utility as the original market with transaction costs. 



2 Preliminaries 
2.1 Setup 

Consider a financial market consisting of a safe asset with price normalized to on^ and a risky 
asset, traded with small proportional transaction costs et > 0- This means purchases of the latter 
are carried out at a higher ask price St + St, whereas sales only earn a lower bid price St — St- 
Put differently, et is the halfwidth of the bid-ask spread. The mid price St is assumed to follow a 
general, not necessarily Markovian, Ito process: 



dSt = b^dt + yf^dWt, 



for a standard Brownian motion Wt- In this setting, an investor trades to maximize expected utility 
from consumption and terminal wealthj^ 



W{x) = sup E 



T 

Uiit,kf)dt + U2iX^i'lp',k')) 







over all consumption rates kf and trading strategies ipf with associated wealth processed 

Xf(V^^F) = x+ l^ldSs- fklds + ^t- fesdWWs. 

Jo Jo 







Here, the first two integrals describe the usual frictionless gains from trading and consumption 
expenditures, respectively. The third term represents the investors' cumulative endowment process, 
which can include both a continuous component, such as labour income, and lump-sum payments, 
such as an option position maturing at the terminal time T. Finally, the last integral accounts for 



^ Since we consider general state-dependent utilities, the safe asset can be normalized without loss of generality. 
Indeed, for an arbitrary safe asset St > 0, one can reduce to this case by using the latter as the numeraire and 
maximizing utility from discounted consumption and terminal wealth for the utilities ui{uj,t,x) — ui{uj,t,xSt{<-o)) 
and U2{lo,x) = 1x2(1^, xS^ (a;)). 

^Here, ui{uj,t,x) and U2{lo,x) are increasing, concave utility functions in wealth x, depending on time t and the 
state cj in a nonanticipative manner. 

'^||(;A^||t denotes the total variation of (j)"^ , measuring the number of shares traded on [0,*]. 
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the transaction costs incurred by the investors' strategy, by weighting the total variation of the 
latter with the width of the spread. 

In most of the portfolio choice literature dating back to Constantinides and Magill |44j , trans- 
action costs equal a constant fraction of the monetary amount transacted, et = eSt- For highly 
liquid stocks spreads often equal a few ticks, regardless of the stock price, so that a constant spread 
Et = £ may be a more plausible model. Then, transaction costs are levied on the number of shares 
traded as in the futures model of Janecek and Shreve |24j . In general, the dynamics of the spread 
turn out to be inconsequential, as long as it follows an Ito process £t rescaled by a small parameter 
e, i.e., £t = eSt- Henceforth, Et refers to a process of this form without further mention. 



2.2 Inputs from the Prictionless Problem 

In this section, we collect the inputs from the frictionless problem that determine the leading-order 
corrections due to the presence of small transaction costs. 

Denote by Kt, (ft, and Xt{ip, k) = x+ Jq (psdSs — jQ Ksds+'^t the frictionless optimal consumption 
rate, trading strategy, and wealth process, respectively, and write Q for the corresponding marginal 
pricing measurej^ Moreover, let U{t,x) be the investors' indirect utility, i.e., the maximal utility 
that can be obtained on [t, T] starting from wealth x, by trading according to the conditionally 



optimal portfolio/consumption pair {(fsit, x) , Hsit, x))s^[t x] (cf- Equation (B.l) in the appendix). 
With this notation, define the frictionless sensitivities of consumption and investment with respect 
to wealth a^ 

, Kt{t,Xt + 6) - Kt{t,Xt) , ift{t,Xt + 6)-yDt{t,Xt) 

= M A ' = A • 

(5— 5>0 5— >0 

In the spirit of Kramkov and Sirbu |34l [35] , these quantities describe how marginal changes in the 
investors' wealth influence their consumption and investment decisions in the absence of frictions, 
i.e., how much of an extra dollar should be consumed or invested, respectively. 

Finally, the investors' preferences are subsumed by their direct risk tolerance with respect to 
current consumption, 

_ U[{t, Kt) 
Ui{t,Kt) 

as well as their indirect risk tolerance 

U'it,Xt) 

u"{t,xty 

that is, the risk tolerance of the indirect utility, evaluated along the optimal frictionless wealth 
process, The indirect risk tolerance Rt measures the investors' attitude towards future risk: 

(i) If the market is complete or the investors' preferences are described by a standard utility func- 
tion of exponential or power type, then the indirect risk tolerance is given by the conditional 
expectation 

'•^ u[{s,Ks)r u'2{Xt{^,k))' 



Rt = 



-ds 

s,Ks) u'^{Xt{(P,k)) 



(2.1) 



*That is, the dual martingale measure, which is linked to the primal optimizers by the usual first-order conditions 
(see Appendix[c]as well as [30] and the references therein for more details). As observed by Davis 13 , Q-expectations 
describe the investors' pricing rule for a marginal number of contingent claims, whence the name "marginal pricing 
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Here and henceforth, primes always denote derivatives with respect to current wealth. 
^°In a Markovian setting, this is the risk tolerance of the value function, evaluated at the optimal wealth process. 
This object already is central in the frictionless analysis of Merton [491 150j . and also features prominently in the 
recent work of Soner and Touzi |57| . 
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That is, Rt represents the investors' expected risk tolerance with respect to future consump- 
tion and terminal wealth, computed under the marginal pricing measure Q. 



(ii) Beyond complete markets and standard utility functions, (2.1) remains valid if the investors' 
marginal pricing measure Q is replaced with some other equivalent martingale measure Q 
(see Appendix [B|) ■ Even though the latter implicitly depends on the indirect risk tolerance, 
one can therefore still interpret Rt as an expectation of future risk tolerances with respect to 
consumption and terminal wealth. 

(iii) Alternatively, the indirect risk tolerance Rt can also be characterized dynamically in terms 
of the quadratic backward stochastic differential equation ( |B.15 ) . The sensitivities and ip't 



can in turn be expressed in terms of rj and Rt (cf. (B.16)). 



(iv) If the investor focuses exclusively on utility from terminal wealth or from intermediate con- 
sumption, then the above formulas remain valid, setting = or Rt = 0, respectively. 

3 Main Results 

With the inputs from the frictionless problem at hand, the impact of small proportional transaction 
costs Et > on optimal investment and consumption policies, welfare, and implied trading volume 
can now be quantified as follows. 

3.1 Optimal Investment and Consumption 

With small transaction costs et > 0, it is approximately optimaj^ to consume at rate 

Kl = Kt + < (Xf k') - Xtiif, k)) , (3.1) 

while engaging in the minimal amount of trading necessary to keep the number ipf of risky shares 
within the time and state dependent no-trade region [NT^ — ANTj,NTj + ANTj] with midpoint 

WTt = ipt + ^'t {Xti^', k') - Xt{^, k)) (3.2) 

and halfwidth 



SRt d{ip) 
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A first crucial observation is that the future dynamics of a stochastic spread are disregarded through- 
out at the leading order; only its current width is taken into account. Let us discuss the other 
characteristics of this policy in more detail: 

(i) Small transaction costs only influence consumption by affecting the investors' wealth. In view 



of (3.1), the optimal rate is simply adjusted according to the corresponding sensitivity k[ of 
the frictionless optimizer and the change Xf{if^, k^) — Xt{ip, k) in wealth caused by applying 
the policy {ipt, Kf) with frictions rather than (ipt, i^t) without these. The sensitivity coefficient 
can be written as n't = rt/Rt (cf. Appendix |b]) ; hence, it is strictly positive and trades off the 
relative importance of present and future consumption streams. For large rt/Rt, investors are 
less concerned about changes in their current consumption level than at later times. Hence, 
they are willing to deviate substantially from the frictionless target to react to changes in 



^That is, the utility obtained from this policy is optimal at the leading order 0{e^^^) for small costs et = e£t 



6 



their wealth. If fluctuations in current consumption are deemed relatively more important 
than at later times, the situation is reversed. The optimal wealth with transaction costs 
is typically smaller than its frictionless counterpart. Therefore, small trading costs tend to 
reduce consumption accordingly. The greatest reductions occur at those times where investors 
are most tolerant with respect to changes in their consumption level. 

(ii) The interpretation for the midpoint NT^ of the no-trade region is similar: it is shifted com- 
pared to the frictionless position (pt to account for the wealth effect of past transaction costs. 
These change the investors' optimal wealth at time t from Xt{ip,K) to Xf{ip^,K'^), and the 
target position is adjusted accordingly, reminiscent of a Taylor expansion holding all other 
variables fixed. The scaling factor is the sensitivity ip[ = d{R, S)t/ Rtd{S)t, determined by 
the local dynamics of the indirect risk tolerance Rt and the asset price St (cf. Appendix |B]). 

(iii) The halfwidth ANT^ of the no-trade region is given by the cubic root of three factors p| 
The term et corresponds to the absolute halfwidth of the bid-ask spread. Larger frictions 
necessitate wider inactivity regions, regardless of their future dynamics. The factor 2>Rt/2 
reflects the investors' tolerance to risk. Ceteris paribus, more risk tolerant investors are willing 
to accept larger deviations from their frictionless target in order to save transaction costs. 
Finally, the portfolio gamma d{(p)t/d{S)t trades off the local activity rates of the frictionless 
optimal strategy and the market. Tracking highly oscillatory targets requires wide buffers to 
save transaction costs. Conversely, wildly fluctuating asset prices necessitate close tracking 
to reduce losses due to displacement from the frictionless position. 

(iv) For utility from terminal wealth only = 0), the formulas for the optimal trading 
strategy remain valid. Consumption is of course null in this case. Conversely, the pure 
consumption case {u2{x) = 0) is also covered by the above formulas. 

In summary, the adjustment of the leading-order optimal policy due to small transaction costs 
is myopic in the sense that it only depends on non-local quantities associated to the frictionless 
optimization problem, namely the investors' frictionless optimal policy and risk-tolerance wealth 
process. Even if the frictionless optimizer includes intertemporal hedging terms in models with 
stochastic opportunity sets, the effect of small trading costs is purely local: these only enter through 
the current width of the spread and the investors' frictional wealth. 

3.2 Welfare 

The welfare effect of transaction costs across different models and preference structures is most 
easily compared in terms of certainty equivalents. To this end, let U^{x) and U{x) denote the 
maximal utilities that can be obtained starting from initial capital x, with and without transaction 
costs Et = e£t, respectively. Then, at the leading order 0(e^/^): 



U^x) ^u(x-EQ ^-^^d{S)t] ) . (3.4) 



Hence, the above Q-expectation represents the amount of initial capital the investor would be ready 
to forgo to trade the risky asset without transaction costs, i.e., the certainty equivalent loss due to 
small frictions. This leading-order optimal performance is attained by the consumption/portfolio 



pair from Section 3.1 Let us discuss some of the implications of this result: 



^For local utility maximizers with constant risk tolerance, an analogous result has been obtained by Martin |47| . 
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(i) In view of (3.4), the certainty equivalent loss due to small transaction costs is determined by 
the future squared halfwidths of the optimal no-trade region, suitably averaged with respect 
to both time and states. Here, time is measured in terms of business time d{S)t, i.e., with 
a clock that runs at the speed of the market's local variance. As a result, losses due to 
transaction costs accrue more rapidly in times of frequent price moves. Averaging across 
states is performed under the marginal pricing measure Q: the impact of small costs is priced 
according to the frictionless investors' marginal pricing rule. 

(ii) The squared halfwidths of the no-trade region are normalized by the investors' risk tolerance 
Rt- The interpretation is that more risk-tolerant investors are less willing to give up initial 
endowment to get rid of the extra risks induced by future frictions. 

(iii) As observed by Rogers ^1], the welfare effect of transaction costs is composed of two parts. 
On the one hand, there are the direct costs incurred by trading. On the other hand, there is 
the displacement effect of having to deviate from the frictionless optimizer. At the leading 
order, the relative magnitudes of these two effects are universal (cf. Appendix [E|) : two thirds 
of the welfare loss are caused directly by trading costs, whereas the remaining one third is due 
to displacement. Remarkably, this holds true irrespective of asset price and cost dynamics, as 
well as the investors' preference structure. For a small Tobin tax, this implies that two thirds 
of the corresponding certainty equivalent loss actually correspond to tax payments, whereas 
the remaining one third dissipates due to suboptimal portfolio composition. 



(iv) Fixing the investors' risk tolerance. Formulas (3.3) and (3.4) show that the activity rate 
or portfolio gamma d{(p)t/d{S)t of the frictionless optimizer and the market determines the 
impact of a non-trivial spread et- The portfolio gamma d{(p)t/ d{S)t therefore quantifies the 
investors' exposure to "liquidity risk" In complete markets, this notion reduces to the usual 
"gamma" of the portfolio (cf. the discussion in [27]), in line with the widespread interpretation 
of the latter as a sensitivity with respect to trading costs (see, e.g., [SJ Section 9.3]). 



3.3 Implied Trading Volume Dynamics 

One severe shortcoming of frictionless diffusion models is that they lead to the absurd conclusion 
that the number of shares transacted is infinite on any finite time interval. This makes it difficult 
to draw conclusions about the "trading volume" generated by a given policy. As a remedy, one can 
turn to the culprit of this phenomenon, namely the Brownian component of a diffusion strategy (pt, 
and measure its activity in terms of its local quadratic variation d{(j))t- But this notion of trading 
volume is ad hoc, and it is unclear how to relate it to the notions of share and wealth turnover 
prevalent in the empirical literature (cf., e.g., pfU]). 

Models with transaction costs present an appealing alternative, leading to finite turnover. How- 
ever, a corresponding representative investor also does not match the trading volume observed in 
real markets: the resulting trading schemes are of "bang-bang" -type, i.e., volume is either zero (in 
the no-trade region) or trades take place at an infinite rate (when the boundaries of the no-trade 
region are breached). Yet, as spreads decline, the trading times become more and more frequent. 
At the leading order, the corresponding turnover can then be approximated by a finite rate, in 
line with the models typically used in the price impact literature (e.g., [2| 1181 [22] ). The resulting 
formulas identify the quadratic variation of the frictionless target strategy as the trading volume 
corresponding to suboptimal implementation with a no-trade region of constant width. In contrast, 

^^It is important to emphasize that "hquidity" only refers to the width of the bid-ask spread here, and not to other 
proxies such as the ones proposed by, e.g., Acharya and Pedersen [T]. 
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optimal rebalancing leads to a turnover rate depending both on both the fluctuations of the target 
and the market. 

To make this precise, consider a generic frictional strategy (j)^ prescribing the minimal amount 
of trading necessary to remain inside a symmetric no-trade region (pt ± around a frictionless 
diffusion strategy 4>t- For tight tracking (A^ ~ 0) the corresponding absolute share turnover is then 
given by (see Appendix [F]) : 



At the leading order, the turnover rate is therefore determined by the ratio of the local fluctuations 
d{(f))t of the frictionless diffusion being tracked, and the width A^ of the no-trade region around it. 
Tracking a more active strategy generates higher turnover, whereas using a wider buffer decreases 
the required trading volume. If a buffer with constant width is used, the trading activity generated 
by tracking the diffusion strategy (j)t is indeed determined by the quadratic variation of the latter, 
up to a constant. This justifies the use of this quantity as a measure of trading activity, but only if 
frictionless strategies are implemented suboptimally by using a no-trade region of constant width. 



For the approximately optimal strategy (fl from Section 3.1, the general formula (3.5) takes the 
following form: 

Measured in business time, the turnover rate corresponding to tracking the frictionless target in an 
approximately optimal manner is therefore determined by the cubic root of the following inputs: 

(i) The inverse of the cubic root of the absolute halfwidth £t of the bid-ask spread. Smaller 
spreads allow to keep narrower buffers and therefore lead to increased turnover. 

(ii) The investors' risk-aversion process 1/Rt- Ceteris paribus, more risk averse investors track 
the frictionless target more tightly to reduce displacement losses, thereby generating higher 
turnover. 



(iii) The portfolio gamma d{(f)t/d{S)t, already encountered in Sections 3.1 and 3.2, following a 
quickly moving target requires more adjustments. 

In calendar time, the leading-order optimal trading rate is determined by the geometric average 
d{ip)'^^^d{S)y^ , scaled by risk tolerance and the spread. In particular, the turnover generated by the 
optimal implementation of a frictionless strategy in the presence of small transaction costs depends 
not only on the activity of the frictionless target, but also on the fluctuations of the market. This 
is in line with the empirically observed positive relationship between volume and volatility [|31J. 



Formula (3.6) also leads to a fully dynamic model for turnover in the presence of a random and 
time-varying spread. Since our model allows for random endowments, it applies to diverse types 
of investors, ranging from mutual funds rebalancing to maximize their long-run growth rate, high- 
frequency traders reacting dynamically to various signals from the market, to option desks hedging 
their exposure to derivative securities written on the risky asset. In each case, the resulting turnover 
only depends on the spread through the cubic root of its current width. All other things being 
equal, the model therefore predicts that reducing a small spread by 10% should increase volume by 
about 3.6%, regardless of asset and cost dynamics, preferences, and different investor types. This 
matches quite well with the empirical estimates of Epps |T7j . A thorough econometric analysis of 
the model's testable implications is a challenging direction for future research. 
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4 Constant Relative Risk Tolerance 



The simplest special case of the above general results is given by investors with exponential utilities. 
Their absolute risk tolerances with respect to intermediate consumption and terminal wealth are 
constant, leading to a deterministic indirect risk tolerance even with a random endowment stream 
(see Appendix [B]) . This setting and applications to utility-based pricing and hedging are discussed 
at length in the companion paper of the present study [27J. 

In this section, we specialize the general results of the previous section to the specification 
most widely used in the literature on portfolio choice, namely isoelastic utilities with constant 
relative risk tolerance I/7 > 0. More specifically, fix an impatience rate 5 > 0, a scaling factor 
/? > 0, and suppose the utilities from consumption and terminal wealth are both of power type, 
i.e., ui{t,x) = I3e^^'^~^'^x^^''' /{I — 7) and U2{x) = x^^"'/{l — 7), resp. ui{t,x) = (3e^^'^~^^ logx and 
iJ'2ix) = logx for 7 = 1. Without random endowments {^t = 0), constant relative risk tolerance 
implies that the investors' indirect risk tolerances are given by constant multiple of their optimal 
wealth processes (cf . Appendix |B]) : 

Rt = Xt{^,K)/^. 

As in the frictionless case, all quantities of interest are therefore most conveniently expressed in 
relative terms To wit, write the dynamics of the risky asset in terms of returns, dSt/St = dYt, and 
express portfolio/consumption pairs in terms of the risky weighlj^vrf = ipfSt/X^{ip^,K'^) and the 
consumption/wealth ratio cf = nf / Xf {^p^ , k^) . Finally, parametrize transaction costs as fractions 
of current stock prices, St = "qtSt- With these notions, the wealth effect of past transaction costs 



is absorbed, so that (3.1-3.3), (3.4), and (3.6) lead to particularly simple formulas for the optimal 



policy as well as the associated welfare and trading volume. 
4.1 Optimal Investment and Consumption 

For investors with constant relative risk tolerance, it is approximately optimal to consume the 
same fraction of current wealth as in the frictionless case, cf = q, and engage in the minimal 
amount of trading necessary to keep the risky fraction vrf within a symmetric no-trade region 



[vTt — AvTf, TTt + Avr^] around the frictionless target weight vrt. In view of (3.3 ), the halfwidth Avr^ is 



given explicitly in terms of the local dynamics of vrt and the return process dYt = dSt/Sf. 

Several remarks are in order here: 

(i) The investors' optimal consumption/wealth ratio is unaffected by the presence of small fric- 
tions. Whereas the adverse effect of the transaction costs tends to reduce the absolute level 
of consumption, the fraction of current wealth to be consumed remains the same. 

(ii) Up to the factor 3r]t/2j, which only depends on the relative spread ijt and the investors' 
preferences but not on the asset dynamics, the half- width of the no-trade region is completely 
determined by the frictionless optimal risky fraction ttj as well as the joint dynamics of the 
latter and the return process Yt . In models that can be solved in closed- form in the absence of 



^*As is well known, random endowments generally destroy the homotheticity of the problem and therefore rule out 
the simplifications offered by this change of parametrization. 

^^At the leading order, it does not matter whether this fraction is evaluated at the bid-, ask-, mid-, or any other 
price process taking values in the bid-ask spread. 
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frictions (e.g., Il9l[50l[33l|6Tl|8l[39]), this immediately yields explicit formulas for the optimal 
no-trade regions with small costs. 

The first term, 7r|(l — vrt)^, already arises in the Black-Scholes model |23tBl[TU]. It corresponds 
to the buffer necessary to account for changes in the risky fraction due to price moves of the 
risky asset. Accordingly, this term vanishes for vrj = or vr^ = 1, when full investment in 
either the safe or the risky asset locally immunizes the investors' risky fraction from price 
changes of the risky asset. 

The last term, d{Tr)t/d{Y)t, comes into play in models with stochastic opportunity sets, where 
the frictionless target weight vrj is typically no longer constant. Here, the interpretation is 
similar to the boundaries expressed in numbers of risky shares: there is a tradeoff between the 
local fluctuations of the target and the returns of the risky asset. Wildly fluctuating targets 
necessitate a wide buffer to save transaction costs, however, quickly oscillating asset prices 
require close tracking to limit displacement from the frictionless portfolio composition. 

The first and the last term are always positive. The second term — 27rt(l — 7rt)d{TT, Y)t/d{Y)t, 
however, can be either positive or negative, depending both on whether the investors' position 
is leveraged (vr^ > 1) or not, and on the correlation between shocks to returns and the 
frictionless weight. The interpretation is that the no-trade region can be narrowed if the 
fluctuations of the frictionless number of shares is reduced because shocks to returns and the 
target weight partially offset. Conversely, the no-trade region is widened if the directions of 
the two effects tend to agree. Accordingly, the sign of this term switches when passing from 
an unlevered to a levered position, because positive returns then lead to negative shocks to 
the risky weight. 

(iii) In the special case of logarithmic utility, the frictionless optimal portfolio/consumption policy 
can be determined explicitly in general ^50j. The optimal consumption/wealth ratio is then 
completely determined by the investors' time horizon and their impatience rate, whereas the 
corresponding risky weight irt should be held equal to the Merton proportion, i.e., the market's 
(infinitesimal) mean- variance ratio. Put differently, the problem is myopic, i.e., completely 
determined by the local dynamics of the underlying asset price model, and consumption and 
investment separate. Our above results show that both of these findings are robust with 
respect to small frictions: the optimal consumption/ wealth ratio remains the same, and the 
trading boundaries are completely determined by the local dynamics of the model, just like 
the optimal frictionless weight. 

(iv) The above formulas remain valid if the investor only focuses on utility from intermediate 
consumption or from terminal wealth, setting consumption to zero in the second case. 



4.2 Welfare 



To conveniently express the certainty equivalent loss (3.4) due to small transaction costs in terms 
of relative quantities, denote by P the measure with density process <^(/g iTsdYs)t relative to the 



marginal pricing measure QP^J Then, the leading-order certainty equivalent loss due to small 



transaction costs can be described as follows: 



7 



Jo (S'{fQCudu)t 



d{Y)t 



(4.2) 



^^Note that, in the absence of consumption, TVsdYs)t is just the optimal frictionless wealth process starting 
from unit initial capital. 
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Hence, the above P-expectation is the fraction of initial capital the investor would be willing to 
give up to trade the risky asset without transaction costs. This is the scale- invariant measure for 
the effect of transaction costs computed numerically by Balduzzi and Lynch [3j. 

(i) Without intermediate consumption (q = 0), the above formula for the certainty equivalent 
loss can also be interpreted in term of equivalent safe rates as in [19]. To wit, let Pt,Pt 
be fictitious safe rates, for which a full safe investment yields the same utility as trading 
optimally in the original market without and with transaction costs, respectively: 

U{x) = U2{xeP^^) resp. U%x) = U2{xePTT). 

In view of the homotheticity U{x) = x^^'^U{l) of the indirect utility inherited from the 
isoelastic utility U2{x) = x^~^ /{I — 7), this gives 



PT = ^logu^HU{l)) resp. p'^ = ^logu^HU^l)). 
Therefore, also taking into account U2^{x) = ((1 — ^)x)^^^^~'^\ (4.2), and Taylor expanding: 



1 1 

Pt = PT + ^ log T77TT ~ PT + \^ 



7 rT 
2 



(i-7)r ^ u{i) (i-7)r ^ u{i) 

(4.3) 



PT - 



Hence, without intermediate consumption, the certainty equivalent loss per year and unit of 
initial wealth also admits an alternative interpretation as the reduction of the equivalent safe 
rate due to small transaction costs. 

(ii) In the absence of consumption (ct = 0), the impact of small transaction costs is, up to a 
constant, given by the squared-halfwidth (Avrt)^ of the no-trade region, suitably averaged 
with respect to both time and states. Here, time is measured in terms of d{Y)t, i.e., the 
activity of the returns. Averaging across states is performed under the auxiliary measure P 
which, incidentally, also appears in the asymptotic analysis of utility-based prices and hedging 
strategies by Kramkov and Sirbu \35\ I36j. Moreover, it coincides with the myopic probability 
of Guasoni and Robertson [21], under which a hypothetical log-investor chooses the same 
optimal policy as the original isoelastic investor under the physical probability. Accordingly, 
P = P ii the original isoelastic investor has a logarithmic utility with unit risk tolerance. 

(iii) With a nontrivial consumption/ wealth ratio q > 0, the future half-widths (Att^)^ of the 
no-trade region are "discounted" by £'{fQCsds)t = exp( c^ds), a measure of consumption 
accrued until then. This takes into account that intermediate consumption reduces wealth, 
so that the same trading boundaries in terms of fractions of wealth lead to less turnover and 
thereby a smaller welfare effect of transaction costs. 

4.3 Implied Trading Volume Dynamics 

Just like trading policies and welfare, turnover is best expressed in terms of relative quantities 



for power utilities. Then, (3.6) leads to tractable formulas for the measures typically used in the 
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empirical literature |40] . More specifically, relative share turnover (number of shares traded divided 
by number of shares held) is given by 



ShTuT 

Jo m\ 

at the leading order. Similarly, relative wealth turnover (wealth transacted divided by wealth held) 
can be expressed as 



WeTuT 



Xf{ip,K) 

" ''"^^y"'" Q(i - - - + d{Y), 







27 J V d{Y)t d{Y)t 



If the frictionless target vrj is constant, these formulas reduce to the constant rates of Gerhold et 
al. |19]. Generally, they are stationary in that their time averages converge to a long-run mean if 
the transaction costs r/t, the frictionless risky weight ttj, and the market's variance d{Y)t/dt are all 
driven by stationary factors. Moreover, the turnover rate inherits properties such as mean-reversion 
and autocorrelation from the model's state variables, which allows to recapture corresponding 
findings from the empirical literature [4£)]. 



5 Mean- Variance Portfolio Selection 

Dating back to the seminal work of Markowitz |35], Tobin |59j and Merton |51j . mean-variance 
portfolio selection has been arguably the most widely used approach to portfolio optimization, 
both in theory and practice. Here, a portfolio is called mean-variance optimal or efficient if it 
minimizes the variance for a given mean or, equivalently, maximizes the mean for a given variance 
bound. Fix an initial endowment x and a time horizon T > 0. Then, as recapitulated in Appendix [G] 
(also cf. [53l |55l [631 El)) the optimal portfolio is obtained from the one for the quadratic utility 
function u{x) = —x^ as follows. Let C/(— 1) and (\)t be the maximal expected quadratic utility and 
the corresponding optimal portfolio for the standardized initial endowment —1. To achieve a target 
mean E\Xt^{}p')\ = m > x with minimal variance, trade the (m — x)/{l + C/(— l))-fold of (pt and 
keep the remaining initial endowment x + (m — x)/(l + [/(—!)) invested in the safe asset. The 
corresponding minimal variance is given by 

Var[Xr((/?)] = (m - x)^ "^("1) 



1 + [/(-!) 



In particular, this leads to the well-known result that the Sharpe ratio is the same for any mean- 
variance efficient portfolio, irrespective of the target mean m > x: 



^ E[Xt{^)] -X ^ / 

VVar[XT(v^)] V U{-1) 

This also determines the maximal return for a given variance bound as 

E[XT{ip) -x] = sSR. 
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Let us now discuss how these results adapt to the presence of small transaction costsj^ As 
derived in Appendix [Gj the optimal portfolio is again obtained by rescaling its counterpart for 
quadratic utility. To wit, denoting the latter by 0f and writing U^{—1) for the corresponding 
maximal expected quadratic utility for initial endowment —1, the mean- variance efficient strategy 
ipf with target return E[Xj,{ip)] = m > x is given by the (m — 2;)/(l + C/^(— l))-fold of (j)f, where the 
remaining initial endowment x+{m—x) / (—1)) is again held in the safe asset. By applying the 
general results of Section [s] to (truncated) quadratic utility (see Appendix [G] for more details), this 
shows that the mean- variance optimal strategy for a given target mean m> x and small transaction 
costs £t is to keep the number of risky shares in the no-trade region [NT — ANT, NT + ANT] , with 
half-width ANTt and midpoint NTj obtained from their counterparts ANTf and NT^ for the 
optimal portfolio with quadratic utility and initial endowment —1 by rescaling: 



ANTi = ANTf, 



where ANT? 



'3£t(l- j^ct^sdSs) d{^)t 
2 d{S)t 



1/3 



and 



NT. 



m — X 



-NT[, where NT^ 



l + U^{-l) ' Xt{ 

Compared to the frictionless case, the multiplier is increased from (m — x)/(l + U{—1)) to 



m — X 



m — X 



1 + U'{-1) 1 + U{-1) 



1 + 2 



-Ui-1) 



1 + [/(-! 



(ANTf)2 



t 

2(1 - J^Cf^s^^s) 



d{S)t 



where Q denotes the variance-optimal martingale measure with density dQ/dP = X2^((/))/C/(— 1). 
Hence, larger frictions require bigger multipliers to achieve the same target return. The correspond- 
ing minimal variance required to obtain the same return E[Xj,{ip'^)] = m > x with transaction costs 
is increased to 



Var[X|.(v?^)] = (m - xf 



l + C/^(-l) 



Var[XT(v?)] 1 + 



l + C/(-l) 



eQ 



(ANTf 



2(1 - /q (l)sdS, 



-d{S)t 



Conversely, small transaction costs reduce the maximal expected return for a given variance bound 
from sSR to 



\ SR 



T 



(ANTf)2 



2(l-/o^(/-,d5,) 



d{S)t 



The portfolio adjustments as well as the increases in variance resp. decreases in returns are bigger 
for more ambitious target returns resp. looser variance bounds. Nevertheless, the maximal Sharpe 
ratios remain universal among mean-variance efficient portfolios also with small transaction costs: 



SR^ ~ SR 



1 + SR^ 
SR 



eQ 



T 



(ANTf 



2(1 - /g (psdSs 



-d{S)t 



In summary, mean-variance optimal strategies are obtained by rescaling the optimal portfolio 
for (truncated) quadratic utility like in the absence of frictions. To obtain the same target return 



'^In the Black-Scholes model, transaction costs of arbitrary size have been considered by Dai, Xu, and Zhou |12] 
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(resp. variance bound) even with transaction costs, the multipHer and the corresponding variance 
have to be increased (resp. the correponding return has to be decreased). Even though this leads 
to wider no-trade regions for higher target returns (resp. larger variance bounds), the Sharpe ratio 
is still the same for all mean-variance efficient portfolios: it is simply reduced by a constant to 
account for the presence of a non-trivial bid-ask spread. The reduction of the optimal Sharpe ratio 
due to small transaction costs can again be decomposed into direct trading costs, and displacement 
from the frictionless target. Here, transaction costs directly contribute at least two thirds of the 
Sharpe ratio loss, whereas at most one third is caused by displacement. 



6 The Growth-Optimal PortfoUo 

Suppose the investor does not receive a random endowment stream (^'^ = 0). Then, as already 



pointed out in Section 4.1 above, the optimal trading boundaries (4.1) for logarithmic utility do 
not depend on the time horizon, as they are completely determined by the local dynamics of the 
asset prices and the myopic frictionless optimizer. Without intermediate consumption, this policy 
therefore maximizes logarithmic utility E[logXj.{ilj^)] on any horizon T > 0, and thereby also 
the expected growth rate of wealth limsuprp^^ ^S[logX|,(V''^)], both at the leading order for small 
costs. 

As in the frictionless case \29\ Theorem 3.10.1], this property can be strengthened in an almost 
sure sense. Indeed, the log-optimal portfolio (pf maximizes the almost sure long-term growth rate, 
at the leading order for small costs: 

limsup;^logX|.(V'^) < limsup ^logX^i^^) + o{e'^/^), (6.1) 

for all competitors ip'^ . In the frictionless case, this property has attracted the attention of vari- 
ous authors dating back to Kelly [32j, Latane [38], Markowitz |46j . and Breiman [6]. The trading 



boundaries (4.1) provide the corresponding leading-order correction for small proportional trans- 
action costs, characterized explicitly in terms of the local asset dynamics as in the fricitionless 
case. The impact of a small spread on the maximal asymptotic growth rate can also be quantified 
explicitly. At the leading order, it is given by 

Inp- Xrp(in\ — lim — Inp- X^dn'^^ = lim — / 

T^oo T 



lim 1 log XT{ip) - lim 1 logX^((^^) = lim ^ f (^d(y),, (6.2 



if the limits exist. Hence, the reduction of the long-term growth rate due to small transaction 
costs is given by the long-run average squared halfwidth of the no-trade region, measured in risky 
fractions and computed in business time. For concrete models with stationary state variables, this 
readily yields explicit formulas by integrating against the corresponding invariant measures. 



7 Conclusion 

This paper provides simple asymptotic formulas for optimal investment and consumption policies 
as well as the corresponding welfare and implied trading volume, in the practically relevant limiting 
regime of small bid-ask spreads. 

We find that investors should keep their positions in a time- and state-dependent no-trade 
region, which is myopic in that it is fully determined by the current spread, the investors' frictional 
wealth, and quantities inferred from the frictionless version of the problem. The corresponding 
welfare effect of small transaction costs is determined by the squared halfwidths of future no-trade 
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regions, suitably averaged across time and states. Here, the investors' strategy enters through the 
activity rate d{ip)t/ d{S)t - the portfoho's gamma in complete markets - which therefore quantifies 
the exposure to liquidity risk through changes in the spread. At the optimum, two thirds of the 
utility loss are incurred due to actual trading costs, whereas the remaining one third is caused by 
displacement from the frictionless target. Optimal implementation of frictionless strategies in the 
presence of a small bid-ask spread leads to a turnover rate determined by the local fluctuations of 
both the target strategy and the target; the spread only enters though the cubic root of its current 
width. 

All of these results are surprisingly robust: they apply for general diffusive asset price and 
cost dynamics, arbitrary preferences over intermediate consumption and terminal wealth, and also 
in the presence of random endowment streams. Moreover, they can be extended to cover other 
widely-used optimization criteria such as mean-variance portfolio selection and the maximization 
of the long-term growth rate. 



Appendix: Derivation of the Main Results 

In the sequel, we derive the results presented in the previous sections. Throughout, mathematical 
formalism is treated liberally. Full verification theorems have been worked out by Soner and 
Touzi [57J as well as Possamai, Soner and Touzi [52] for infinite-horizon consumption problems in 
complete Markovian markets. 



A Notation 

Throughout, we write (j) ' St for the stochastic integral (figdSg and denote the identity process 
by It = t. For an Ito process Xt, we write 6^^ for its drift rate, resp. hf'^ if the latter is computed 
under another measure Q. Moreover, we denote by = d{X)t/dt its local quadratic variation, 
and by c^^ = d{X, Y)t/dt its local covariation with another Ito process Yt. 



B The Risk-Tolerance Process 

We begin with an analysis of the investors' indirect risk tolerance. This object describes how 
the frictionless investors' attitude towards future risks changes with their wealth level. Thereby, 
it determines how the wealth effect of a small friction is reflected in the optimal policy and the 
corresponding utility. As a result, we expect it to play a pivotal role not only for the proportional 
transaction costs considered here but also in the asymptotic analysis of other market imperfections. 

To define the investors' indirect risk tolerance, denote by Xt = Xt{ip, n) the optimal wealth pro- 
cess of the frictionless utility maximization problem, generated by the optimal portfolio/consumption 
pair [(ft^Kt). Define the investors' indirect utility as 



U{t, x) = sup Et 



ui{s,ks)ds + U2 (^x + ipsdSg-J^ ksds + '^t^ 



(B.l) 



where the supremum is taken over all portfolio/consumption pairs on [t,T]. Then, we call the 
risk tolerance Rt = —U'{t, Xt)/U"{t, Xt) of the indirect utility, evaluated along the optimal wealth 
process, indirect risk tolerance. In this section, we investigate the properties of this object. First, 
we describe its local dynamics in terms of a quadratic backward stochastic differential equation 
(henceforth BSDE). Then, we discuss how it can be represented as a suitable expectation of the 
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terminal risk tolerance —U2{Xt)/u2{Xt) and the intermediate risk tolerances —Ui{t,K,t)/u'({t,Kt) 
from consumption, and describe how it generalizes the risk-tolerance wealth process of Kramkov 
and Sirbu [35j. 



B.l A Dynamic Characterization of the Indirect Risk Tolerance 

Our starting point is the dynamic programming principle, which states that, for any infinitesimal 
interval dt: 



U(t-dt,x)= sup lui{t - dt,kt-dt)dt + Et-dt[Uit,x + iptdSt - h-dtdt)] 

{ipt,kt-dt) 

=■ sup f{t- dt,x,'ilJt,kt-dt) =■■ f{t- dt,x,ift{x),Kt^dt{x)). 

{ipt,kt-dt) 

Here, ipt{x) and Kt-dt{x) denote the (predictable) optimal number of risky shares and the (adapted) 
optimal consumption given wealth x at time t — dt. Evaluated along the optimal wealth process, 
these coincide with the globally optimal portfolio/consumption pair iftjKt-dt- The optimality of 
iftix), nt^dt{x) implies that the respective partial derivatives vanish: 

f^{t- dt,x,ipt{x),Kt-.dt{x)) = f^,{t - dt,x,ipt{x),Kt^dt{x)) = 0. (B.2) 

Moreover, we have 

fifiKit - dt,x,ipt{x),Kt-dt{x)) = Et^dt[U"{t,x + ift{x)dSt - Kt~dtdt)dStdt\ = 0. (B.3) 

In the following calculations, we suppress the arguments of the functions to ease notation. In view 

of (ig: 

U' = U + U^p'tix) + UK^^dti^) = U. (B.4) 



As a result: 



Moreover, (B.2) and (B.3) also yield 
and therefore 

4ix) = -^, <-dti-) = -^. (B.5) 
U" = + Uif'tix) + /.«k;_^,(x) = - - (B.6) 

Jipip Jkk 

With these preparations, we now argue by formal recursion that the indirect marginal utility 
U'{t, Xt) evaluated along the optimal wealth process coincides with the dual martingale density Zt, 
which is - up to normalization ~ the density process of the marginal pricing measure Q. This is 
evidently true at the terminal time T, where U'{T,Xt) = u'2{Xt) = Zj- reduces to the well-known 
first-order condition linking the solutions of the primal and dual problems (see, e.g., [291 Section 
3.6]). Now, suppose that it is already known that 

U'{t,Xt) = Zt. (B.7) 
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Then, (B.4) and the definition of / give 



U'{t - dt,x) = fx{t - dt,x,ipt{x),Kt-dt{x)) = Et-dt[U'{t,x + (pt{x)dSt - Kt-.dt{x)dt)]. 



Evaluated at the optimal wealth Xt^dt and using (B.7), this shows 

U'{t-dt,Xt-dt) = Et-dt[Zt] = Z, 



t-dt- 



(Bi 



Here, the last equality follows from the martingale property of Zt, completing the recursion. Also 
note that, together with ( B.2[ ), this gives 

= fnit - dt, Xt-dt, ft, Kt-dt) = u'i{t - dt, Kt-dt)dt + Et^dt[-Ztdt] = u[{t - dt, K,t-dt)dt - Zt-dtdt, 
and thereby 



Zt-dt = u'lit - dt, nt-dt)- 



(B.9) 



Next , we turn to the indirect risk tolerance Rt = —U'{t,Xt)/ U" {t,Xt), and describe its dynamics 
by means of a BSDE. To achieve this, it is easier to start from the indirect risk aversion R^^ = 
—U"{t,Xt)/U'{t,Xt) = —U"{t,Xt)/Zt. In view of (B.6), we first compute the various partial 
derivatives of /. By definition of /, we have 

fxx{t - dt,x,ipt{x),Kt-dt{x)) 

= Et-dt[U"{t,x + ipt{x)dSt - Kt-.dt{x)dt)] 



E, 



t-dt 



jj,,. , / N ,c. / \^+\^ {t,x + Lpt{x)dSt- Kt-dt{x)dt) 

U {t,x + ft{x)dSt - Kt-dt{x)dt)--j- — - — ^— — — 

U'(t, X + Lpt{x)dbt - Kt-dt{x)dt) 



Evaluating at Xt-dt, inserting the definition of R^ ^ as well as (B.7), and using the generalized 
Bayes' rule, it follows that 



fxx{t — dt, Xt-dt, ft, Kt-dt) — —Zt-dtE^_^^[Rt ^] — - Zt-dtE^_^^[R^}^^ + dRf^ ^] 

= -Zt-dt{Rtldt + bf"''^dt). 



(B.IO) 



Next, 



fx^{t - dt,x,Lpt{x), Kt~dt{x)) = Et-dt[U"{t,x + Lpt{x)dSt - Kt-dt{x)dt)dSt]. 
Similarly as above, evaluation at Xt-dt gives 

fxip{t — dt, Xt-dt, ft, K't-dt) = ~ Zt-dtRtldt^?-dt\'^^t\ ~ ^t-dtEf-dt\dRt ^dSt 

= —Zt-dtcf ^dt, 



(B.ll) 



where we have used the martingale property of St under the marginal pricing measure Q for the 
second step. Likewise, 

f^^{t - dt,x,ftix), Kt-dt{x)) = Et-dt[U"{t,x + ft{x)dSt - Kt-dt{x)dt){dStf], 

so that evaluation at Xt-dt yields 

f^^{t - dt, Xt-dt, ft, Kt-dt) = -Zt-dtEf_^^[Rt^{dStf] 

= -Zt-dt{R;ldtE?-dt[idSt?]) - EljdR^HdStf] 

= -Zt-dtRtldt^ldt. (B.12) 
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Next, 

fxK{t - dt,x,(pt{x),Kt-dtix)) = Et^dt[U"{t,x + ipt{x)dSt - Kt-dt{x)dt){-dt)], 
and in turn 

fxnit — dt, Xt-dt, Vt, i^t-dt) = Zt-dtE^_dt[Rt ^dt] = Zt-dtRt-^dt^^ + ^t-dtE?-dMRt ^^*] 

= Zt-dtR;^dtdt- (B.13) 

Finally: 

- dt, X, (pt{x), Kt-dtix)) = u'lit - dt, Kt--dt)dt + Et-dt[U"{t, x + (ptix)dSt - Kt^dt{x)dt){dtf]. 

Hence, arguing as above and using the definition of rt-dt - 
well as ( |R9l ): 

- dt,Xt^dt, 'Pt, Kt-dt) = -Zt^dtrtldtd^ ~ Zt-dtE'^^^^[Rt'^{dtf] = -Zt-dtrtldtd^- 
Inserting ( |B.10 B.14) into ( |B.6 ) then gives 



u'^{t - dt, Kt-dt)/u'{{t - dt, Kt-dt) as 

(B.14) 



U"{t - dt, Xt-dt) = -Zt-dtRtldt - Zt-dt \ hf'''^ - Rt- 



dt- 



(q 



-Zt-dtRt 



t-dt 



Zt- 



dt 0^ 



r.S 





ft-dt 




^t-dt 


n 


j dt. 


Rl 



dt 



Since —Zt-dtRtldt ~ ^"{^ ~ dt, Xt-dt) by definition of the indirect risk tolerance and (B.8), it 
follows that 



Ri 



+ 



Rl 



Now, taking into account that Ito's formula yields dR^ ^ = — i?^ ^dRt + R^ ^d{R)t and in turn 
indirect risk tolerance Rt'. 



as well as c? 



"t 



-Rl '^cf'^ , we obtain the following BSDE for the 



lR,Q 



1 
lit 



(q 



n, 



Rt 



A{Xt) 

u'iiXTY 



(B.15) 



where the terminal condition follows directly from the definition of Rt- The special cases of utility 
only from terminal wealth {ui{t,x) = 0) or only from intermediate consumption (u2(x) = 0) can 
be dealt with as above, setting either rt, t < T , ox the terminal value Rt equal to zero. 



Remark B.l. Equation (B.15) is a quadratic BSDE for Rt. Indeed, suppose the filtration is 
generated by a d- dimensional Q-Brownian motion and dSt = atdW^ for an W^-valued volatility 
process at- Then, (B.15) takes the form 



dRt 



CjQt {aJCt? 
RtcrJat 



rt]dt + CtdW^ 



Rt 



u'^{Xt) 



Rt Rtajat - ■ ---^ ' 

In the above computations, we have also determined the sensitivity of the optimal portfo- 
lio/consumption pair {ipt,Kt) with respect to chang es in wealth . Indeed, evaluating (B.5) along 



the optimal wealth process and taking into account (B.11-B.14) as well as c- 
obtain 



R-^S 



"P't 



MS 



Rtcf 



rt_ 

Rt 



R^cf-^, we 



(B.16) 
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B.2 Expected Risk Tolerances and Risk- Tolerance Wealth Processes 



Next, we discuss how the indirect risk tolerance can be represented as the expectation of risk 
tolerances with respect to future consumption and terminal wealth, computed under a suitable 
equivalent martingale measure. 



Risk- Tolerance Wealth Processes 

ance 



First, consider the special case where the total risk toler- 



rfdt + Rt 



u[{t,nt] 



dt 



T 



,0 JO u'{{t,Kt) 4'(Xt) 

can be replicated by dynamic trading in the frictionless market. In the absence of consumption, 
this means that a risk-tolerance wealth process in the sense of Kramkov and Sirbu [35] exists. 

If the total risk tolerance can be replicated, Jq rtdt + Rx = Ro + Jq iptdSt^ then our indirect risk 
tolerance can be represented as the expectation of future risk tolerances with respect to consumption 
and terminal wealth, computed under under the marginal pricing measure Q: 



Rt = E'r 



rgds + Rt 



-ds 



T) 



li-j^ (s. Kg 



(B.17) 



Indeed, the Q-martingale property of St and in turn Jq ipsdSg allows to rewrite (B.17) as Jq r^ds + 
Rt = Ro + f^tl^sdSs, so that Rt has Q-drift b^'^ = —rt. Moreover, this representation also implies 
cf — {cf-^)"^ /ct = V't cf — "0? cf = 0, thereby showing that the right-hand side of (B.17) indeed 
satisfies the BSDE (B.15) for Rt- Without intermediate consumption {rt = 0), this also shows that 
the indirect risk tolerance is given by the risk-tolerance wealth process E^[Rt] + f^tl^sdSs, if the 
latter exists. The replicability of the total risk tolerance is guaranteed in the important special 
cases of complete markets and standard utility functions of exponential or power type: 

(i) Complete Markets: here, any payoff can be replicated, so in particular this holds for the total 



risk tolerance. Then, (B.17) is computed under the unique equivalent martingale measure for 
the market at hand. 

(ii) Exponential Utilities: suppose the utilities from terminal wealth and intermediate consump- 
tion all have constant absolute risk tolerances: ^2(2;) = — e~^^^ and ui{t,x) = — /3e^^"^~*^e~^^^ 
for risk-tolerances l/pi,l/p2 > 0, some impatience rate 5 > 0, and a scaling factor /3 > 0. 
Then, the total risk tolerance is constant and therefore evidently replicated by V't = 0. As a 
result, the indirect risk tolerance is deterministic and given by Rt = l/p2 + ~ ^)l{/3>o}/Pi- 
In particular, it coincides with the investors' constant risk-tolerance in the absence of con- 
sumption. The sensitivities in (B.16) read as y?^ = and = l{^>o}/(^ ~ * + Pi/P2)- 



(iii) 



Power Utilities: next, consider utilities with constant relative risk tolerance, in the absence 
of random endowment {^t = 0). First, suppose there is no intermediate consumption (rt = 
0) and utility from terminal wealth has constant relative risk tolerance I/7 > 0, that is, 
U2{x) = x^~'^/{l — 7). In this case, the total risk tolerance is given by the l/7-fold of the 
optimal wealth process, Rt = Xt/^, and is therefore evidently replicated by the l/7-fold 
of the optimal trading strategy ipt- With intermediate consumption this remains valid, if 
the utilities ui{t,x) also have the same constant relative risk tolerance I/7, e.g., if they are 
of the widely used form ui{t,x) = /3e^^"^~*^x^~'''/(l — 7) for some impatience rate 5 > 
and scaling factor /3 > 0. Then, the risk tolerance with respect to consumption is given 
by the l/7-fold of the latter: rt = Kt/7. As a result, Rt = Xt/j for the wealth process 



20 



Xt = X + f^ipsdSs — Jq Ksds generated by the optimal portfolio/consumption pair {(pt,Kt). 
Indeed, Rt = Xt/^ has Q-drift —{Kt/j)dt = —rtdt, because the risky asset is a Q-martingale. 

— \j, iu thereby solves the BSDE (|B.15|). The sensitivities 
(ft/Xt and k[ 



As it also satisfies cP' 



in (B.16) are given by ip[ 



0, it thereby solves the BSDE ( |B.15[ ) 

Kt/Xt. 



Expected Risk- Tolerance Beyond the Replicable Case If the total risk tolerance can be 



replicated by dynamic trading, (B.17) shows that the indirect risk tolerance can be interpreted as the 



expectation of risk tolerances with respect to future consumption and terminal wealth, computed 
under the investors' marginal pricing measure. Generally, this result is only applicable if the market 
is complete or the investors' preferences are described by a standard utility function. Nevertheless, 
the interpretation of the indirect risk tolerance as an expectation of future risk tolerances with 
respect to consumption and terminal wealth can be transferred to the general case, as we now 
argue. To this end, decompose the risk tolerance-process Rt as follows: 



dRt 



b^'^^dt + ^ 



RS 



-dSt + dR 



t ■ 



(B.18) 



Then, Rf- is a Q-martingale orthogonal to St, because d{R^,S)t = cf^dt 



-cfdt 



0. Hence, 



(B.18) is - up to the drift term - the Galtchouk-Kunita-Watanabe decomposition of the indirect 



risk tolerance. Now, consider the measure 

Q ~ Q, with density process Zt = S{R~^ • R^)t- 

This defines an equivalent martingale measure: since St and Zt are orthogonal Q-martingales, 
integration by parts shows that ZtSt is a Q-martingale and St is in turn a Q-martingale. 

In the general case, the equivalent martingale measure Q replaces the marginal pricing measure 
Q in determining the indirect risk tolerance as an expectation of future risk tolerances with respect 
to consumption and terminal wealth: 



Rt = E": 



T 



Tgds + Rt 



(B.19) 



To see why (B.19) holds true, verify that the right-hand-side of (B.19) satisfies the BSDE (B.15) 



By (B.19) and Girsanov' theorem, the drift of Rt under the measure Q with density process Zt is 

(MS\2^ 

I \(^t ) 



given by 



-rtdt = b^'^dt = bf'^dt - {R, R'^ • R^)t = bf'^dt 



Rt 



dt. 



As a consequence, the specification (B.19) indeed solves (B.15) and therefore provides an interpre- 



tation of the risk tolerance- process as the expectation of future risk tolerances under Q. 



C Martingale Optimality Conditions 

In this section, we derive conditions that ensure the approximate optimality of a family of candidate 
policies as the spread £t = eSt becomes small. To ease the exposition, we first briefiy recall their 
exact counterparts for the frictionless case (e = 0). 
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C.l Prictionless Optimality Conditions 



Without transaction costs (e = 0), a portfolio/consumption pair {ipt,Kt) is optimal if (and essen- 
tially only if |29t I3U|) there exists a process Zt satisfying the following optimality conditions: 

i) Zt is a martingale. 

ii) Zt is a martingale density, i.e., ZfSt is a martingale. 

iii) Zt = u'i{t, Kt) for < t < T, and Zt = U2{Xt{^, k))- 

The first two conditions imply that Zt is - up to normalization - the density of an equivalent 
martingale measure Q. The third links it to the optimal consumption stream and terminal payoff 
by the usual first-order conditions. 

Let us briefly recall why Conditions i)-iii) imply the optimality of {ipt,i^t)- For any compet- 
ing portfolio/consumption pair {tpt,kt), the concavity of the utility functions •) and U2{-), 
Condition iii), and Fubini's theorem imply 

E [Ui{; k)'lT + U2(Xt(V, m - E [ni(-, k)'It + U2{Xt{^, k))] 

< E [<(•, K){k -k)'It + u^iXriv, K)){XT{iJ, k) - Xt{^, k))] 
= E [Z{k -k)'It + Zt{Xt{iIj, k) - Xt{v, k))] 
= ZqE"^ [{k -k)'It + (Xr(^, k) - Xt{v, k))] 
= ZoE<^ [{^ - v9) • St] 
= 0, 

where the last equality follows from the Q-martingale property of {ip—(p) • St, which is a consequence 
of Conditions i) and ii). This shows that {ipt,Kt) is indeed optimal. 



C.2 Approximate Optimality Conditions with Transaction Costs 

Let us now derive approximate versions of the optimality conditions i)-iii), ensuring the approximate 
optimality of a family (tpf, Kt)e>o of portfolio consumption pairs, at the leading order 0(e^/^) as 
the spread et = e£t tends to zero Since the frictional optimizers converge to their frictionless 
counterparts {ipt, Kt), it suffices to consider strategies and consumption rates that coincide with ipt 
resp. Kt up to terms of order o(l). 

Now, let ((/?f,Kf)e>o be a family of portfolio/consumption pairs whose optimality we want to 
verify. As in the frictionless case above, for any family of competitors {ipf, kt)e>Oj the concavity of 
ui(t, •) and U2{-) implies 

E [u,i; ¥) ' It + U2{XHr, k'))] - E [ui{; k') ' It + n2(X|^(<^^ k'))] 

<i^K(•,K^)(A:^-«^)•/T + n'2(x^((/p^K^))(xf,(^^A:^)-x^((^^K^))], ^ • ^ 

where Xfitp^ , k'^) and X^y?^, k^) are the wealth processes generated by the portfolio/consumption 
pairs [tptikt) resp. {(pt,Kt) with transaction costs et = eSt- Now, suppose we can find frictionless 
shadow prices Sf evolving in the bid-ask spreads [St — et, St + et], which match the trading prices 
St ± et in the original market with transaction costs whenever the respective strategy ipf trades. 
Then, the frictional wealth associated to {(pf, Kf ) evidently coincides with its frictionless counterpart 
for St, i.e., k^) = x + (p^ * S^ — • It + ^t- For any other policy {tpt, kt), trading in terms 



^Rogers [54] provides a simple probabilistic argument why this is the relevant order. 
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of Sf rather than with the original bid-ask spread can only increase wealth, since trades are carried 
out at potentially more favorable prices: Xj^{'tp^, k^) < x + tp'^ • Sj^ — ' It + ^t- Together with 
(C.l ), this implies: 

E [ui{; k') • It + n2{X^{r, k'))] - E [ui{; k^) • It + U2{X^{y^', k'))] 

< E [<(., - '^') • It + n'^iX^if', k')) ((V' - v') ' S't " (k' - ' h)] ■ ^ ' ^ 

Now, suppose we can find a process Zf satisfying the following approximate versions of i)-iii) above: 

i^) is approximately a martingale, up to a drift b^^ • It of order 0(e^/^). 

ii^) The martingale part Mf^ of Z^ is an approximate martingale density, in that the drift rate 
of MfSI is of order 0(e2/3). 

iii=) Zl = u[{t, Kf) + 0(e2/3) for < t < T, and = u'^{X^{^^ , k^)) + ©(e^/S). 



Then, since kf — = o(l) and — ipf = o(l). Condition iii^) implies that (C.2) can be written as 

E [ni(., k') . It + U2iX^ii^' , k'))] - E [ni(-, k') • h + n2(X|.((^^ k'))] 

< E [Z'{¥ - k') 'It + Z't ((V" - 93") • S't - [k' - k') • It)] + o{e'^l'^). ^ ' ' 

Let be the measure with density process given by the martingale part M^^ of Zf. Then Fubini's 
theorem yields 

E[Z'{k' - k') • It] = Z^E^'[{k' - k') • It] + E[{{b^' • I){k' - k')) • h] 

where we have used kf — Kf = o(l) and i"^) for the second step. Likewise, i^) also yields 

E [Z't Hi'' - ^1 • S't - ik' - Ac^) • It)] = Z^E^' [{{i;' - ip^ • 5^ - {k' - k^) • It)] + o{e'/^), 

because 'i/'f — kf — nf are both of order o(l). Combining these two estimates gives 

E [uii, k') • It + n2(X^(V'^ k'))] - E [ui{-, k') • h + u^iXf^i^^ k'))] 
<ZIE'^\{^' -^')'S?r]-ro(e'l^). 

To establish the approximate optimality of the candidate family ((^^,K|)e>o, it therefore remains 
to verify that - given ii"^) - the process (V'^ — y?^) * is an approximate Q'^-martingale, in that the 
drift rate of Mf{('ip'^ — ip^) • Sf) is of order o(e^/^). But this readily follows, applying integration 
by parts twice to obtain 

Mf ((^^ - p^) . SI) = - ^') • S') • Mf + M^\^^ - ^^) . S! + - V') • {M^\S')t 
= - ^') ' - - V>')S') • Mf + - V>') ' {M^^S')t, 

and taking into account that the drift rate of the stochastic integral with respect to the martingale 
Mf^ vanishes, whereas the drift rate of the stochastic integral with respect to MfSf is of order 
o(e^/^) because of ii"^) and since V'f — <^t is of order o(l). 

Summing up, a family of policies {ipf, /tf ) is indeed approximately optimal if we can find shadow 
prices Sf and approximate martingale densities Zf satisfying the approximate optimality conditions 
i^)-iii'^) above. 
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D Approximate Optimality of the Candidate Policy 



We now verify that the pohcy proposed in Section 3.1 satisfies the approximate optimaUty conditions 



i^)-iii^) and is therefore indeed optimal at the leading order 0(e^/^) for small transaction costs 
et = eSt- In a first step, we construct a candidate shadow price. Subsequently, we put forward a 
corresponding martingale density, which satisfies i^) and iii^). In a third step, we then show that 
this martingale density and the candidate shadow price also satisfy ii^). 

Step 1 : define the direct risk tolerance rt from current consumption and the indirect risk 
tolerance Rt as in Appendix[B| and let ^pl correspond to the minimal amount of trading necessary to 
keep the number of risky shares in the random and time- varying no-trade region [NT^ — ANT^, NTt + 



ANT^]. By definitions (3.2), (3.3), and (B.16), its midpoint and halfwidth are given by 



^RS 



NTi = ^t + ^'t{Xt{^', K^) - M^p, k)) = ipt + - Xtip, k)) 

Etc? 



and 

respectively. This no-trade region is not symmetric around the frictionless optimizer ipt. To nev- 
ertheless construct a shadow price similarly as in the symmetric exponential case ^j, decompose 
the deviation Acpt = (pf — pt of the frictional position iff from its frictionless counterpart into two 
parts: Aipt = '^ft + ^'^t- Here, the first term Ap^ = NT^ — ipt measures the deviation of the 
midpoint NTt of the no-trade region from the frictionless target position ipf The second term 
A(p^ = iff — NTf in turn describes the deviation of the frictional position pf from the midpoint 
NTj. Note that this is an Ito process, reflected to remain in the random and time- varying but 
symmetric interval [— ANT^, ANT^]. With this notation, define 
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Sf = St + ASt = St + atAift - jt^ipt 



for 



1 c? / 9 c 



1/3 



One readily verifies that the process Sf takes values in the bid-ask spread [St—et, St+et]. Moreover, 
since at (±ANTt)3-7t(±ANTt) = ^et, it coincides with the bid resp. ask price whenever the policy 
Pt prescribes the purchase resp. sale of risky shares after reaching the boundaries of the no-trade 
region for Acpt = itANTt. Consequently, 5f is a valid candidate shadow price process. 

Next, note that 3a;t(ANTf)^ — 7^ = 0. As a result, integration by parts and Ito's formula give 

ASt - AS'o = a • Apl + Ap^ • at + (a, Ap)t - 7 • Ap^ - Ap • -/t - {j, ^^)t 
= (3aA^^ - 7) • A^t + (3aA^) • (Ap)t 

+ Ap' • at - Ap • 'jt + (3A(/j^) • (a, Ap)t - (7, Ap)t 
= -{3aAp^ - 7) • (99 + Ap)t + (3aA^) • {p + Ap)t 

+ Ap^ ' at- Ap' -ft - (3A99^) • (a, v? + Ap)t (7, <y9 + Ap)t. (D.l) 



^^Note that these definitions match the ones for the case of exponential utiHty [27], where the indirect risk tolerance 
Rt is constant, and the frictionless optimizer coincides with the midpoint of the no-trade region. 
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Here, we have used for the last equahty that (pi = ipt + iS.(p^ + Ac^^ only moves on the set /S.ip^ = 
zbANTi where SatAc/?^ — 74 = 0, and that A(/9^ = ip^ — ipt — Aip^. only differs from —{pt + ^Vt) by a 
finite variation term. Given that the risky asset 5^, the frictionless optimizer (ipt, nt), the indirect 
risk tolerance Rt, the transaction cost process £t = eSt, and their local quadratic (co-)variations 
all follow sufficiently regular Ito processes, the above representation shows that this property is 
passed on to the coefficient processes at and 74 as well as to ASt- Moreover, A(p^ = 0{e^/^), 
Atp^ = 0(e^/^), 7j = 0(e^/^), and at = 0(1) yield that the drift rate of AS'* is given by 

bt' = ^atA^tct + 0{e^'\ (D.2) 



Step 2: set Ak^ = Kf — Kt and define 

AXt = Aip • St - Ak ' It. 

At the leading order 0(e^/^), this process measures the difference between the frictionless optimal 
wealth process Xt{ip,K) = x + ip*St — K'lt + ^t and its frictional counterpart, which can be 
expressed as X^{ip^, k^) = x + cp"^ • S^ — • It + '^t because Sf matches the trading prices whenever 
ipf moves. Indeed, 

x+p^ • Sf-K^ • It-{x+ip • St-K • It) = Aip • St+ip^ • ASt-AK ' It = Aip ' St-An • lt+0{e^'^), 

because integration by parts gives 

ip' • ASt = Ap . ASt + VtASt - poASo -AS'pt- AS)t = 0{e^/^) 

as Ap = 0{e^^^) and the drift and diffusion coefficients of ASt are of order 0(ei/3) and ©(e^/S), 
respectively, by (D.l). With this notation, define 

Zf = (1 - AXt/Rt)Zt, 

where Zt denotes the density process of the frictionless marginal pricing measure Q up to normal- 
ization ^'^ Then, Z^ satisfies the approximate optimality conditions i^) and iii^). Indeed, Taylor 
expansion, the frictionless optimality condition iii), as well as 

At,t = 4-^t = ^AXt + 0{e'/') and n - '''i^*'^*^ 



Rt ' ' ' n'lit,Kt)' 
give 

u[{t, 4) = u'^{t, Kt) + AKtu'lit, Kt) + 0(e2/3) = Zt{l + Antu'lit, Kt)/u[{t, Kt)) + 0{e^^^) 

= Zf + 0(e2/3). 

Likewise and also taking into account the terminal condition for Rt, one obtains 

u'^iX^ip', k'')) = u'^{Xt{p>, k)) + AXtu'^{Xt{p, k)) + 0{e^/^) = Zt{1 - AXt/Rt) + 0{e^'^) 

Hence, the process Zf satisfies iii''). Let us now check that is also verifies i^). To this end, first 
recall that Zt is a martingale by the frictionless optimality condition i). Hence, integration by parts 
shows 

Zl ^ -Z ' (AXt/Rt) - {Z, AX/R)t, (D.3) 



''For exponential utilities with constant indirect risk tolerance -Rt, this definition reduces to the one in (27) . 
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where = denotes equality up to a process with of order 0(e^/^) (here, the remainder in fact has 
zero drift). By Ito's formula, and since AXt = A(/3 • St — Ak • It and Ant = -j^AXt + 0(e^/^): 

AXt AX AX , , Aifi ^ An Aip , 

- ^ -i?t + 4 • {R)t -r'It)+^'(st-l,'{R, S)t) , (D.4) 



i?2 V R \ R 



where = again refers to equality up to a process with drift of order 0{e'^^^). Inserting this into ( D.3[ ) 
and writing the density process of the frictionless marginal pricing measure Q as Zt/Zo = S{N)t 
leads to 

Zf - ^ • + (N,R),-\- (R), + '■•'')- ^ • {st + (N,S)t-^- (R. S), 



i?2V R J R \ R J 

Now, recall that the deviation Aipt = Aipf- + A^p^ of the frictional position from its frictionless 
counterpart is composed of two parts. The first, which represents the shift of the midpoint of 
the no-trade region due to the past effects of transaction costs, is an Ito process with drift and 
diffusion coefficients of order 0{e^^^). In contrast, the second term measuring deviations from 
the midpoint of the no-trade region, also involves reflection off the boundaries ztANT^ = 0{e^^^). 
Hence, this process is not only small, but also oscillates quickly around its mean zero. Therefore, 
it can be neglected at the leading order in the time average above. As a result, replacing Aipt by 

Zf ^ — ^ b^'^ - 7^ - ] + r ^b^'^ • It- 



R^ V ^ V J , 

The frictionless price process St is a martingale under the marginal pricing measure Q with density 

S O 

process Zt/Z^ by the frictionless optimality condition ii); hence its Q-drift rate vanishes, b^ = 0. 



Together with the BSDE (B.15) for the indirect risk tolerance Rt, this shows that that drift of Zf 



is indeed of order 0{e^/^) as required by i 

Step 3: to establish the leading-order optimality of the proposed policy {ipf,K,f), it remains to 
verify the approximate optimality condition ii^), i.e., that M^^S^ is approximately a martingale 
for the martingale part of Zf. To see this, first notice that integration by parts and the 

martingale property of M^^ yield 

MfSl ^ M^" • Sf + {M^\S')t 
^ Z' • S't + (Z^ S')t 

= Z{1 - AX/R) ' {St + ASt) + {Z{1 - AX/R), S + AS)t 

^ Z{1 - AX/R) 'St + Z'ASt + {Z{1 - AX/R), S)t 

= Z{1 - AX/R) ' {St + {N, S)t) + Z'ASt-Z' {AX/R, S)t, 

where = once more denotes equality up to a process with drift of order 0{e'^^^). Here, the second 
step uses that only differs from its martingale part Mf by a finite variation drift of order ©(e^/'^) 
as verified above. For the fourth step, we have used that the drift and diffusion coefficients of ASt 
are of order 0{e^^^) resp. 0(e^/^), and that AXt is of order 0(e^/^). Now, inserting representation 
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(D.4) for /S.Xt/ Rt and using that St + {N, S)tis a. martingale by the frictionless optimahty condition 



ii) and Girsanov's theorem, it fohows that 

AX 



urSf ^Z' [ASt 



^RS 



z 

0, 



,A5 



i?2 



{R,s)t 
'It 



{S), 



R 



where we have again replaced A(pt in the time average by Aip^ 



step, whereas the last step is a consequence of (D.2). In summary, the approximate optimality 



ISj AXt + 0(e2/3) in the second 



conditions i^)-iii'^) are satisfied, so that the policy {(f ^, 
optimal. 



^ nf) from Section 3.1 is indeed approximately 



E Computation of the Utility Loss 



By definition of Aki 



-Kf, the consumption adjustment due to transaction costs can be written 



as Aki = ^AXt + 0(e2/^), where AXt = A(f * St — An • It as above. Then, Taylor expanding 
•), using Fubini's theorem, and inserting the frictionless optimality condition iii) as well as the 
definition of r^, it follows that 

E[ui{-, k') 'It]-E [ui{-, k) • It] = ( E[u[{-, k)Ak] + l-E[u'l{-, k)Ak^]] • It + 0(e) 



Zq (e^[Ah'It] - -E^ 



AX' 



It 



+ 0{e) 



for the frictionless marginal pricing measure Q with density process given by Zi/Zq. Similarly, for 
the utility from terminal wealth, expand U2{-) and use the frictionless optimality condition iii) as 
well as (99^, K^) — Xt{ip, k) = AXt + 0(e^^'^) and the terminal condition for Rt- 



E[U2{XU^', k'))] - E[u2{XT[ip, k))] = E[u'2[Xt{^, t^)){XHv', k 



Xt{^,k))] 

XT{v,K)f] 

ZoE^[Aip • ST + {ip + Aip) ' ASt - Ak • It] 



+ -E[u'i{XT{ip,K){XH^',K' 



0(e) 



- -ZoE"^ 
2 " 



Rt 



+ 0{e). 



Since the risky asset St is a Q-martingale by the frictionless optimality condition ii), the Q- 
expectation of Aip • St vanishes. Moreover, by, e.g., |30l Theorem 3.10.(v)] the initial value Zo 
of the dual martingale density coincides with the derivative U'{x) of the frictionless value function 



with respect to wealth (see also (B.7)). Combing this with the two estimates above, the differ- 



ence between the utilities obtained by applying the approximately optimal policy ((^ 
transaction costs and the frictionless optimizer (994, Kt) is therefore found to be given by 



with 



W{x) - U{x) 



U'{x) (-\e^ 



AX"' 



AX^ 



+ E^[{^ + A^)'ASt]^ +0{e). 



Here, the first term also arises when computing the utility derived from (ipf, nf) traded at the mid 
price St rather than the shadow price St = St + ASt- Consequently, it measures the displace- 
ment loss incurred by deviating from the frictionless optimizer {ipt,Kt)- The second term in turn 
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represents the additional losses directly induced by the trading costs. Let us first focus on the 
displacement loss. Integration by parts and the definition of AXt yield 

AX^ = 2AXAv3 • St - 2AXAk • It + Aip^ • {S)t. 

Moreover, Ito's formula gives - i?o ^ = -R-"^ • Rt + R'^ • {R)t- Integrating by parts agam, 
inserting Ant = ^AXt + 0(e^/^), and using that St is a Q-martingale it follows that 



eQ 



Rt 



eQ [AX^ • iiy^ + R-^ • AXl + {R-'^,AX^)t 
AX^ „ AX^ ,„ _ 2AX2 



eQ 



2AXA(p 



{R,R)t 



fi2 



- • /t + • {S)t 



R^ 



{R,S)t +0{e). 



The argument of this expectation has Q-drift 
AX2 



1 c _ \ . /A(/j2 g 2AXA(f 



R 



R 



i?2 



It. 



To proceed, extend the "averaging" argument from Step 2 of Section |D] above by noticing that 
- at the leading order - all occurrences of the oscillatory part Aip^ of Acpt = Aip^ + Acp^ can 
be replaced in the above time integrals by their expectations under the uniform distribution on 
[— ANTf,ANTt] (compare I20| : also see Section |F] for more details). As a result the above 
Q-drift can be rewritten as 



AX' 



1 



it 



A^ s , (ANT)2 5 2AXA^ 



R 



3R 



i?2 



It + 0{e) 



2r] ' It + 



(ANT)^ 
3R 



'lT + 0{e), 



where we have used Aipf = ^ g AXf + 0(e2/3) to obtain the second equality. Taking into account 

Tefore follows that 

(ANT)2 



the BSDE ( |B.15[ ) for Rt, it therefore follows that 

AX2 



eQ 


"AX2" 











i?2 



It + 



3i? 



(S) 



T 



+ 0{e). 



The displacement loss is in turn given by 
1 



:eQ 



AX2 AX^ 
' It + 



i?2 



Rt 



eQ 



(ANT)2 
R 



{S) 



T 



+ 0(e). 



Next, consider the transaction cost loss E^[(p • ASt] + E^[Aip • ASt]- Since ASt = 0{e), we 
have 

if' ASt = nASt - ^oASo -AS'ift- {AS, ip)t = -{AS, ip)t + 0(e). 

. . '2 

Representation (D.l ) for ASt shows that —(AS", ip)t = {SaAip — 7) • + 0{e), so that it follows 
from the definition of at and jt as well as • It = {S)t that 



E^[^ • ASt] = 



Aip - (ANT)2 
R 



{Sh 



+ 0{e). 
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Again replacing - at the leading order 0{e^/^) - Ac^^ by its expectation (ANTt)^/3 under the 
uniform distribution on [— ANTt, ANT^] in the above time average, we obtain 



E'^lif • A5t] = — E'^ 



(ANT)2 



{S)t 



0{e). 



Finally, consider the second part E'^ [A(^ • A5't] of the transaction cost loss. It can be computed 
by integrating the Q-drift rate of A(/9 • AS't. Since the diffusion coefficient of A5't is of order 0(e^/'^) 



by (D.l), Girsanov's theorem shows that the latter coincides with the corresponding drift under 



the physical probability at the leading order 0(e^/^). In view of (D.2) and the definition of aj, the 

latter is given by • l^S)t + 0{e). Now, insert A^jj = A(^j + A(^j and once again approximate 

the oscillatory terms A^jj and Ay?^ by their expectations resp. (ANTt)^/3 under the uniform law 
on [-ANTt, ANTt]. Then: 



E^[A(/j • A5t] = -E^ 



(ANT)2 
R 



+ 0{e) 



As a consequence, the transaction cost loss E^[{ip + Aip) • ASt] + 0{e) is twice as large as the 
corresponding displacement loss at the leading order, and the total utility loss is given by 



U{x) - U%x) = U'{x)E^ 



(ANT)2 
2R 



{S)t 



0(6) 



Formula (3.4) for the certainty equivalent loss then follows by Taylor expansion. 



F Derivation of the Implied Trading Volume 

Consider an arbitrary diffusion strategy dcpt = bfdt + \J~(^dWt, implemented by performing the 
minimal amount of trading to keep the actual number (/>f of risky shares in a symmetric buffer 
[(pt — Af , (j)t + Af] with halfwidth Af = 0{e^), > 0, around (pf This means that trading of local 
time type occurs whenever the number 0f of risky shares reaches the moving boundaries (j)t ± Af , 
and the actual number of risky shares is the difference between the cumulative numbers of shares 
purchased resp. sold: = Lt — Ut- Put differently, the difference A</>t = (p^ — cpt = Lt — Ut — (pt 
between the actual position (pf and the target cpt is a diffusion reflected to remain in the small 
interval [— A|, A|] with width of order 0{e'^). To compute the turnover ||(/'^||t = L^ + Ut, we have 
to estimate the growth of the two local time processes, at the leading order for small e. 
To this end, rescale both time and space to obtain the rescaled process 

{Dr)r>0 = {e'''A(p^^2,.)r>0, 

which has drift b!f^ = e~^e'^^h'^ 2u = 0{e^) and squared diffusion coefficient = e~'^^ e^^ 0^-^,21, = 
0(1) in the interior of the no-trade region, where (pl is constant. Now, divide the interval [0,T] 
into a mesh = tg < • • • < ^Af= ~ ^ with mesh size of order 0{£'^), set rf = e~^'^tf, i = 0, . . . , N^, 
and decompose 

A</>T = e^E(^.f-I).^J. 
1=1 

As e becomes small, the time intervals [Tf_]^,rf] becomes longer and longer, whereas the reflecting 
barriers ibe~'^A^^2i. remain approximately constant on [T?_]^,r?]. At the leading order, each incre- 
ment — -Dt-e therefore corresponds to the one of a driftless Brownian motion with variance 

i i — 1 
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c^e £21/ > reflected to remain in the interval e A^^ ^a^, £2^]- As the time interval [Tf_]^,T?] 
grows, the corresponding local times therefore approach the long-run averages for reflected Brow- 
nian motion, which have been derived, e.g., in [23i Remark 4]. As a result, cumulative purchases 
and sales coincide - at the leading order 0{£~'^) for small transaction costs £t = e£t - and are both 
given by 



As the mesh size becomes small, they therefore converge to the integrals 



4Af J„ 4Af 

At the leading order £~'^, the corresponding absolute turnover is thus given by 



2Af 



(it. 



All these considerations hold for any strategy that remains close to a diffusion by means of reflection 
off two symmetric moving boundaries. The explicit representation for the approximately optimal 
strategy ff from Section [3.1| follows by inserting Formula (3.3) for maximal deviations Af, and 



taking into account that {tp + A(p)t = at the leading order 0(e^/^). 



G Mean- Variance Portfolio Selection 

In this section, we derive the results of Section [5] on mean- variance portfolio selection with small 
transaction costs. For the convenience of the reader, we first briefly recall the well-known frictionless 
case, and its connection to the maximization of (truncated) quadratic utility. 

G.l The Frictionless Case 

The Markowitz portfolio selection problem of minimizing the portfolio's variance Var[Xr(i/^)] for a 
given mean E[Xt{iP)] = m > x is equivalent to minimizing E[X^{tp)] — m? — A(£'[Xr('0)] — m), 
for a Lagrange multiplier A > such that the constraint E[Xt{iI^)] = m is satisfied. The optimal 
strategy in turn corresponds to the maximizer of 

E[-{Xt{iP)-\/2)\ 

that is, to the optimal strategy for the quadratic utility u{x) = —x"^ starting from initial endowment 
X — A/2. By the homotheticity of the quadratic utility maximization problem, the latter is given 
by the (A/2 — 2;)-fold of the optimal strategy 4)t for quadratic utility and the standardized initial 
endowment —1. Setting up this portfolio only requires an initial endowment of x — A/2; for the 
mean-variance efficient portfolio (pt^ the remaining endowment of A/2 is kept in the safe asset. 
Hence, its wealth process is given by 

Xriip) = A/2 + (A/2 - x){-l + • St). 

Now, let U{—1) = E[—{—l+4) • St)^] be the quadratic utility generated by the optimal portfolio 
for initial endowment —1. Then by, e.g., |7t Lemma 3.1.5] the latter coincides with the mean of 
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this portfolio: U{—1) = E[—l + (j) • St]- As a result, the Lagrange multiplier A is determined by 
the constraint m = = A/2 + (A/2 - x)C/(-l) as 

A _ m + xU{-l) 
2 ~ l + U{-l) ■ 

The corresponding minimal variance is in turn given by 

Var[XT(v^)] = - ^) (^[(-1 + 'Z' • St?] - E[-l + cj) • 5t]') 

\ 2 

m — X \ , _ 2n 



l + ?7(-l 
(m — x) 



-u{-i)-u{-iY) 



2 -f^(-i) 



1 + [/(-!) 

As a result, for any target mean m > x, the corresponding optimal Sharpe ratio is the same: 



^ E[XT{ip)] -X ^ / 

v/Var[XT(v?)] V f^(-l) 

In particular, this determines the maximal return for a given variance bound as 

E[XT{ip) -x] = sSR. 



G.2 Small Transaction Costs 

Let us now consider how the above results adapt to small proportional transaction costs £t = eSt- 
Maximizing expected quadratic utility is covered by the results of the previous sections, except for 
the fact that u{x) = —x^ is not increasing for wealth levels beyond the bliss point x = 0. However, 
for continuous asset prices, the optimal portfolio starting from initial wealth —1 never crosses the 
bliss point, cf., e.g., [H Lemma 3.7]; thereby, it is also optimal for the truncated quadratic utility 
— min(3;,0)^, which fits into our setting. Hence, the optimal strategy for the shadow price from 
Appendix D is also optimal with transaction costs. Indeed, it is optimal for quadratic utility and 
thereby also for the truncated version in the shadow market. Since the latter provides better terms 
of trade for any portfolio, but the same for the optimizer, the corresponding optimal portfolio is 
also optimal for the monotone truncated quadratic utility in the original market with transaction 
costs. But since the portfolio always remains below the bliss point x = 0, it is a fortiori also optimal 
for the quadratic utility with transaction costs. 

By the homotheticity of the quadratic utility maximization problem, the risk-tolerance process 
for u[x) = — is given by 

Rt = l-4>'St. 

In view of (3.3), the leading-order optimal strategy for quadratic utility and initial endowment — 1 
with transaction costs et therefore keeps the number of shares <j)f within a no-trade region centered 
around the midpoint NT? = + (ptiXf {(P^) - Xf ((/>)) = (/)tXf (0^)/Xt((/))Q with halfwidth 

* V 2 d{S)tJ • 

^^Here, Xt{(t>^) and Xt{4>) dentote the wealth processes generated by trading (j>t and <j>t, respectively, starting from 
initial endowment —1. 
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The mean-variance optimal portfolio iff satisfying the constraint E[Xj,{(f^)] = m is obtained by 
holding a cash position of and trading the (A''/2 — x)-fold of c^"^, where refers to the Lagrange 
multiplier A*" = 2(m + xC/^(— 1))/(1 + J7^(— 1)) for the shadow price Sf. As a result, the no-trade 
region for ipf is obtained by simply rescaling the one for cpf by a factor of (A^/2 — x). 

Let us now turn to the corresponding welfare effect of small transaction costs. For the quadratic 
utility u{x) = — and initial endowment —1, the latter is determined by Formula (3.4): 



U'{-1) ^ui-l-E^ 



(ANTf)^ 



2(1 



St) 



d{S)i 



2 1 



S, 



where the second step follows from the homotheticity of the quadratic utility maximization problem 
and Taylor expansion. As a result, the maximal Sharpe ratios in the presence of small transaction 
costs Et are given by 



SRf 



1 



-11 



^„ 1 + SR'^ „n 



T 



(ANTf: 



2(1 



(G.l) 



For a given target mean m, the corresponding minimal variance is increased due to small transaction 
costs to 



Yav[X^{ip')] = {m-x) 



-1) 



1 + U'{-1) 



Vav[XT{ip)] 1 + 



1 + U{-1 



^ (ANTf)2 



2(1 



S, 



-d{S)t 



Conversely, small transaction costs reduce the maximal expected return for a given variance bound 
from sSR to 



sSR^ 



SR 



1 + SR^ 
SR 



eQ 



T 



(ANTj* 



2(1 



St) 



d{S)t 



Now, consider the composition of the Sharpe ratio loss caused by small transaction costs. To 
this end, estimate the Sharpe ratio of the frictional optimizer ipf traded at the frictionless mid price 
St- First, notice that the Sharpe ratio is invariant among different multiples of the optimal strategy 



for quadratic utility: 

E[x + 



' St]-x E[(j)^ ' St] 



V^Var[x + e<i)^ • St] ^J\&T[(i)^ ' St] 



for any 9 G (0, oo). 



(G.2) 



In contrast, the corresponding expected quadratic utility E[—{—l + 6(p^ • St)'^] depends on the 
multiplier and is maximized for 9 = E[(j)^ • St]/E[{(I)'^ • St)'^]- Then, E[—l + 94)^ • St] = 
E[—{—l + 94>^ • •S'j')^], so that the corresponding Sharpe ratio is given by 



E[x + 9(P^ ' St]-x _ 1 
VVar[x + 94>'^^S^] ~ V ^[(-1 + Ocf)^ • StY^ 



By (G.2), this Sharpe ratio coincides with the one for the frictional optimizer ipf, since the latter 
is also obtained from (pf by rescaling. As the quadratic utility derived from iff is ~ by definition of 



9 - smaller than the one for 
E[x + ip'' • St]-x 



r, this implies that 



VVar[x + if^ • St] \ + ^ 



StY^ 



1 > 



E[{-1 + ' St)^] 



1. 



(G.3) 
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For the quadratic utility u{x) = —x'^, the computations in Appendix [e| show that the displacement 
loss, i.e., the difference between the optimal frictionless utility £'[—(—1 + cj) ' St)'^] and the per- 
formance E[—{—l + (j)^ • S'r)^] of the frictional optimizer traded at the mid price, amounts to one 
third of the total utility loss at the leading order. Together with (G.3) and Taylor expansion, this 
shows that the reduction of the Sharpe ratio due to displacement amounts to at most one third of 
the total Sharpe ratio loss in (G.l). 



H Long-Run Growth Optimality 

In this section, we argue why - without random endowment {^t = 0) ^ our approximate log- 
optimal portfolio not only maximizes the expected long-term growth rate, but also its almost sure 



counterpart (6.1), at the leading order for small costs. In addition, we also compute the leading- 



order reduction (6.2) of the growth rate due to small transaction costs. To this end, we proceed 
similarly as in Appendices [D] and |Ej but work directly in terms of relative quantities to deal with 
the double limit of small transaction costs and a long horizon. Recall that r]t = £t/St = eSt/St 
denotes the relative bid-ask spread, set 

= 3^^(i-t).y-. ' j ' 7* = 3at(A7rt)2, 

and let vrf be the risky weight corresponding to the minimal amount of trading necessary to remain 
in the no-trade region [nt — Avr^jTr^ + Avr^] around the frictionless target vrt. This means that the 
corresponding number (pi of risky shares is constant while the deviation Avrt = vrf — vrt lies in 
(AvTt, AvTt). With this notation, define 

AYt = at Ait t - 74 Avrt. 

Then by definition of a*, 7t, and Avrt, one readily verifies that AYt decreases from ijt to — r/^ as Ant 
moves from —Airt to Att^. Thus, 

St = St{l + AYt) 

is a valid candidate shadow price process, in that it takes values in the bid-ask spread and coincides 
with the ask resp. bid price whenever purchases resp. sales occur for the policy vrf . In particular, 
the frictional wealth process corresponding to the risky weight vrf coincides with its frictionless 
counterpart for Sf and is therefore given by the stochastic exponential X(o{^ • S^)t. Since the ratio 
St/ St is of the form 1 + 0(e), the deviation AYt is of order 0(e), and St = SQS'{Y)t = Sq + S • Yt, 
the definition of 5^ and integration by parts yield 

. SI = ' {St + AY ' St + S ' AYt + {S,AY)t) 

= vr^ • + vr^-^ • (Ay* + {Y, AY)t) = vr^ • {Yt + AYt + {Y, AY)t) + 0{eT). 
As a result, the frictional wealth process corresponding to vr^ is given b}|^ 

Xt{'K^) = x<^{Ti^ ' {Y + AY + {Y,AY))^^xe^^'^\ (H.l) 



Since we exclusively use relative quantities in this appendix, the arguments of wealth processes refer to risky 
weights rather than numbers of risky shares here. 
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For an arbitrary risky weight i^j, the corresponding expression provides an upper bound for the 
frictional wealth, because trades in the shadow market take place at potentially more favorable 
prices. As a result, for any risky weight "i?!, the ratio of wealth processes satisfies 



x^{i^') ^ • (y + AF + (y, Ay)))T 



X e 



(y + Ay + (y,Ay)))T 
[y + Ay + (y, Ay 



,0{eT) 



vr 



vr^ • (y + Ay) 



X e 



P{eT) 



where the second equality follows from Yor's formula as in [28, Lemma 3.4]. Like in the general 
case, it suffices to consider families of competitors (i?f )e>o converging to the frictionless optimizer 
TTt as e I 0, so that -i^f — 7rf = o(l). 

We now show that, by the respective definitions of at, 7t, and Attj, the process It + Al^ + 
(y, Ay)t — TT^ • (y + Ay)^ is a martingale up to a drift rate of order 0(e:^/^), so that the ratio 
of wealth processes is bounded from above by a martingale up to a multiplicative finite variation 
process Df of order e°('^^'^^^). To this end, we first compute the dynamics of Alj. Integration by 
parts shows that the risky weight 7rf has dynamics 



iflS{{l - vr^) • y + (1 - TT^)S/S^ • AY + finite variation terms)* 
iT%l - it") • y + TT%1 - 71^)8/3^ • Ay + finite variation terms. 



because the frictional number of risky shares 99^ is of finite variation. As a result, the martingale 
part of Attj = 7rf — ttj matches the one of 7r^(l — vr^) • y* + 7r^(l — 11^)3/8^ • Ay* — irt- Hence, using 
integration by parts and Ito's formula to write 



Ay* = 3aA7r • (Avr)* + (SaAvr - 7) • Avr* + Att • at - Att • + (SAvr 



a 



7, Att) 



shows that the diffusion coefficient of Al* is given by the (SatAiTt — 7()-fold of its counterpart for 
7r(l — vr) • y — vr, up to terms of order 0{e). In particular, it is of order 0(e^/^). The drift rate of 
Ay is in turn given by b^^ = 3atA^fCp + 0{e^/^) = SatAntCt^^'^^'^'"" + 0{e^/^). As a result. 



the drift rate of the process Yt + AYt 
order: 



(y, Ay)t — vr^ • (y + Ay)* indeed vanishes at the leading 



AY 



-(7rt + A7rt)(cf +2c 



SatA-KtC- 



7r(l-7r)«y-7r 



-Antcl + Oie^/^) = 0{e^/^), 



by definition of a* and because the frictionless growth-optimal portfolio is given by vrt = bj /cj . 
In summary, (il)^) / Xf {tt^) < DfMf for a martingale Mf and a finite variation process Df, both 
positive and starting at 1, and satisfying 



lim — log Dt 



o{e 



2/3N 



(H.2) 



Let us now argue why this implies the growth-optimality of the proposed policy, at the leading 
order for small costs. Here, the argument follows its frictionless counterpart |29l Theorem 3.10.1], 
up to taking care of the remainder terms in an appropriate manner. Fix 5 E (0, 1); then, Doob's 
maximal inequality (e.g., [16, Corollary 4.8 and Theorem 4.2]) implies 



^Snp 



sup 



ye 

ie[n, 00) 



> e 



5n 



sup Ml 

t£[n,oo) 



< E[M'^] = 1 
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for all n G N. As a consequence: 



n=l 



> 6 



n=l 



In view of the Borel-Cantelli lemma, this shows that P-a.s. there exists some (random) no G N such 
that for all n > no we have sup^g[„ - log (7r^)-Df — ^ ™^ hence 



1 ^ X 

'^^P 7^°g ye/ e\r,e 
ie[n,oo) i -^T\^^)^t 



< 6. 



This in turn yields that, P-a.s. 



hm sup ^ log Xj^{'&^) < lim sup log Xf,(7r^) + 5 + lim sup log Df^. 



Since 6 was arbitrary, combining this with (H.2) shows that the risky weight is indeed growth 
optimal at the leading order 0(e^/^). 

Let us now compute the leading-order reduction of the maximal long-run growth rate due to 
the presence of small transaction costs. To this end, consider the log-ratio of the optimal frictional 
and frictionless wealth processes. In view of (H.l), it can be written as 

= f + ^^'^ 



1 

f 
1 

f 



tt' • {Yt + AYr + {Y, AY)t) 



^-^.{Y + AY)T-7r'YT + - 



An* (Yt-tt* {Y)t) + vr • AYt - 

+ 7r(l-^) • {AY,Y)T)+0{e 



Air 



{Y)t + Avr • AYt 



(H.3) 



Here, we have used in the last step that the diffusion coefficient of AYt is of order 0(e^/^). The 
first term on the right-hand side of (H.3) is a martingale: the drift rate of — vr • {Y)t is given by 
— {bj /cY)cJ = 0. In view of the Dambis-Dubins-Schwarz theorem, it can therefore be written as 
a time-changed Brownian motion. Hence, its long-term average vanishes by the law of the iterated 
logarithm, provided that the local variance cY of the returns is not too far from stationary: 



lim sup — ( Avr • {Yt — tt • {Y)t] 



0. 



(H.4) 



As for the next term, recall AYt = 0(e). Then, integration by parts yields 

^ {tt . AYt) = ^ (tttAYt - vroAyo - AY - ttt - {AY, 7r)T) 



T 



{AY,7T)T + 0{e) 



^ (3a(A7r2 - a/)c"^^"''^*^""'^) * h + 0{e), 
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where we have used for the last step that the martingale part of AYj coincides with the one of 

(SqAtt — 7) • (7r(l — vr) ' Yt — vr^) at the order 0(e^/'^), and have also inserted the the definition of 

"ft. As in Appendices [El and IFI now approximate - at the leading order - the oscillatory deviation 
~ 2 — „ 

Att^ by its expectation Anf/S under the uniform law on [— AvrtjAvrt] in the above time average, 
obtaining 

^ (vr . AYt) = ^ ( aA^2^-(i-.).y-.,2. . j\ ^ Q^^y 



T 



For the third term on the right-hand side of (H.3) we use the same averaging argument: 

^ 2 \ 



1 / Avr 



T 



1 /Avr^ 

IT 



• It ) + 0(e) 



(H.5) 



(H.6) 



Next, consider the fourth term on the right-hand side of (H.3). As before, its martingale part does 
not contribute to the corresponding long-term average. For its drift part, inserting the representa- 
tion determined above gives 



1 / 



T V 



A7r6^ 



^y.I^)=l.( SaAT^'c^'-'--^-'^-- . It) + 0(e) = ^ (aATr^c^^-'^)-^- • It) + 0(e), 



T 



T 



where we have again approximated Attj by its expectation Att^/S under the uniform law on 
[—Ant, Ant]. In summary. 



limsup ^ (Att • AYt) = limsup ^ (aAn^c''^^'''^'^-'' • It) + 0(e). 



T-5-00 



T 



T-s>oo 



(H.7) 



Finally, let us turn to the last term on the right-hand side of (H.3). Inserting the leading-order 
martingale part of AYt, it follows that 

i (7r(l - vr) . {AY,Y)t) = ^ ({SaA/ - ^y<^--)-y--Mi-^)'Y . j\ + o{e) 



I (aA^2^-(i-)*^-'-2-(i-)*^ . It) + 0( 



(H.8) 

Here, we have used the definition of 7^ in the second step and also applied the above averaging 
argument one more time. Now, inserting (H.4-H.8) and the definition of at into (H.3) gives 



hm sup - , . = hm sup - 

T-S.00 -t ^T\J^) T-s>oo I 



lim sup 



-ac-(i--)*^-- -''—]An''lT]+ 0{e) 



1 Avr^ 
T 2 



{Y)T]+0{e). 



Provided all limits exist, this yields the desired formula for the reduction of the long-term growth 
rate caused by the presence of small transaction costs. 
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